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TREATMENT OF MAMMALIAN MYOCARDIUM WITH 
MORPHOGEN LOCALLY, OR WITH MORPHOGENICALLY-TREATED 
MYOGENIC PRECURSOR CELLS 



Field of the Invention 

The present invention relates generaUy to methods and preparations for the treatment of 
mammals, inchiding humans, at risk of, or aflBicted with, loss of or damage to myocardium. The 
methods involve the implantation of mammalian myogenic precursor cells treated with certain 
morphogens, inducers of those morphogens, agonists of the coiresponding moiphogen receptors, 
or with small molecule morphogehic activators. 

Background of the Invention 
Unlike skeletal muscle or smooth muscle, adult mammalian cardiac muscle has extremely 
limited powers of growth and regeneration During development, the myocardium arises by end- 
to-end fusion of myogenic precursor cells to form branched myofibers in which individual cardiac 
myocytes are joined by intercalated disks. The myogenic precursor cells which give rise to the 
myocardium are derived from the splanchic mesoderm, which is derived from the lateral 
mesodermal mesenchyme which, in turn, arises from the mesoderm fomed after gastrulation. It is 
generally believed that there are no remaining myogenic precursor cells in aduh mammaUan 
myocardium and. therefore, lost or damaged myocardium ts typically replaced by fibrotic or scar 
tissue, rather than new myocardium. See, generaHy, B.M. Carlson, ed. (1981) Patten's 
Foundations of Rmbiyology, 4th F^ition . McGraw-Hill, New York. As a result, damage or loss 
of myocardhim due, for example, to myocardial infarction, congestive heart failure, physical 
tnuima (e.g., in an automobile accident), or infection, typically results in a permanent and often 
progressive loss of functional myocardium. 

In contrast, mammalian skeletal musde has much greater capacity for growth and 
regeneration, even in adulthood. Like the myocardium, skeletal muscle has its first origins after 
the induction of the mesodenn. After differentiation of the mesoderm into dorsal, intennediate. 
and lateral mesoderm, the dorsal mesodermal mesenchyme differentiates to form myotomes 
which, in turn, differentiate to form the myogenic precursor ceils which ultimately fonn skeletal 
muscle. Unlike the myogenic precursor cells of the heart, the skeletal muscle precursors fuse 
side-to-side to form unbranched. multinucleated myofibers. Significantly, some portion of the 
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skeletal myogenic precursor ceUs do not differentiate into myocytes but, rather, attach to the 
plasmalemmas of the myocytes. These cells may remain, throughout adulthood, as largely 
undifferentiated, quiescent skeletal muscle "satellite cells." Upon injury of a skeletal muscle, 
however, these satellite cells are revealed to be myogenic precursor cells, or muscle "stem cells," 
whichproliferateanddifferentiatelntonewandfunctionalskeletalmuscle. Even after injury, 
however, a portion of the proliferated satelUte cells remab undiflFerentiated and attach to the 
newly formed myofibers. Thus, the sateUite cells of skeletal muscle provide a constant and 
renewable source of myogMic precursor ceUs which allows for skeletal muscle repair and 
r^eneration throughout mammalian life. 

The proliferation and differentiation of skeletal muscle satellite cells has been extensively 
studied iiudtm. For example, a simple saline extract of skeletal muscle has been shown to cause 
satellite cells to proliferate in culture (Bischoflr(l 989) in Myoblast Transfer Thpr.py Griggs and 
Karpati, eds., pp. 147-158). Similarly, it has been shown that chick embryo extract or the 
conditioned medium of differentiated myotubes from young mice exhibits a strong mitogenic 
effect on satellite cells, but that conditioned medium from older murine myotubes has a lesser 
effect (Mezzogiomo et al. (1993) Mech. Ageing & Dev^lnp 70:35-44). In addition, a number of 
hormones and growth factors have been found to enhance satellite cell proliferation, inchiding 
FGF, PDGF. ACTH, LIF, and IGF (Bischoflf (1989); Mezzogiomo et al. (1993)). Conversely, 
TGF-p, is widely beUeved to inhibit satellite cell proliferation, as does contact with the myofiber 
plasmalemma, but not the basal lamina (Bischoff (1989); but see Hathaway et al. (1991) J. Cell 
Phvsiol. 146:435-441). 

Curiously, in a rat model of skeletal muscle injury, it was found that there were signs of 
sateUite cell differentiation before there were significant signs of sateUite ceU proliferation 
(Rantanen et al. (1995) Lab. Inyest. 72:341-347). This suggests the possibility that there are two 
populations of skeletal muscle sateUite ceUs: "committed satellite cells" which respond to injury by 
rapidly differentiating to replace the injured tissue, and "stem satellite cells" which respond more 
slowly by proliferating and. perhaps, renewing the committed satellite ceU population. In this 
scenario, the.stem satellite cells may undergo mitosis to produce one daughter cell which remains 
a stem satellite cell, and another which becomes a committed satellite cell. 
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In another animal model, autologous mouse skeletal muscle cells were explanted from a 
healthy muscle, proliferated iiudtro. and then implanted into a necrotized skeletal muscle site 
(Alameddine and Fardeau (1989) in Myoblast Transrf^r Th,.r«py Griggs and Karpati, eds., pp. 
159-1 66). In these experiments, it was shown that the transplanted satellite cells were able to 
populate the necrotized area and diffa-entiate into functional myotubes. Similarly, PCT 
Publication WO 96^8541 disdoses that histocompatible donor mouse myoblasts can be 
implanted into the weakened muscle of a mouse model of muscular dystrophy and differentiate 
into myofibers. In addition, it is shown that growth of the myoblasts in bFGF resuhs in 
significantly more new myofibers at the implant site. Thus, skeletal muscle satellite cells, 
proliferated nudtro, may be able to seive as a source of myogenic precursor cells for muscle 
restoration or regeneration therapy. 

The abiUty of skeletal muscle satellite cells to restore or regenerate injured skeletal muscle, 
has led some researchers to test whether myogenic precursor ceUs could be used to replace lost or 
damaged myocardial muscle. For example, mouse fetal cardiomyocytes, which are not terminally 
15 differentiated and retain the ability to divide, have been directly injected into the myocardmm of a 
syngeneic adult mouse, and have been shown to form new and apparently fimctional myocardium 
(Soonpaa et al. (1994) Science 264:98-101). Significantly, it has been shown that skeletal muscle 
satelUte cells, explanted from aduh canine skdetal muscle can be prolifearated in vitro and 
implanted into a site of myocardial cryoinjury. where they appear to differentiate into "cardiac- 
like" muscle celU, possibly in response to morphogenic signals present in the myocardium (Chhi et 
al. (1995^ Ann. Thorac Surg fin-19-l«) 
Morohopen s and Growth Factors 

A great many proteins have now been identified which appear to act as morphogenetic or 
growth factors, regulating cell proliferation and/or diiferentiation. Typically these growth factors 
exert their effects on specific subsets of cells and/or tissues. Thus, for example, epidermal growth 
fiictors, nerve growth factors, fibroblast growth fectors, various hormones, and many other 
proteins indudng or mhibiting cell proliferation or differentiation have been identified and shown 
to affect some subset of cells or tissues. 

One group of morphogenetic protdns, referred to herdn as "morphogens." inchides 
members of the family of bone morphogenetic proteins (BMPs) which were initially identified by 
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their ability to induce ectopic, endochondral bone morphogenesis. Subsequent characterization of 
the nucleic acid and amino acid sequences of the BMPs has shown them to be a subgroup of the 
TGFp superfamily of growth and dififerentiation fectors. Members of the moiphogen family have 
now been shown to inchide the mammalian osteogenic proteinl (OPl, also known as BMP7), 
osteogenic protein2 (0P2), osteogenic protein3 (0P3), BMP2 (also known as BMP2A or 
CBMP2A), BMP3, BMP4 (also known as BMP2B or CBMP2B), BMPS, BMP6. Vgrl, and 
CflJFl, as well as the Xenopus homologue Vgl and the Drosoohila homologues DPP and 60A. 
Members of this femily encode secreted polypeptides that share common structural features, and 
that are similarly processed from a pro-protein to yield a carboxy terminal mature protein of 
approximately 100- no amino adds. All members share a conserved pattern of cysteines in this 
domain and the active form of these proteins is either a disulfide-bonded homodimer of a single 
family member, or a heterodimer of two different members (see, e.g., Massague (1990) Annu. 
Rev, Cell Biol. 6:597; Sampath, et al. (1990) J. Biol. Chem. 265 :1319«) 

The members of the morphogen family of proteins are expressed naturally in a variety of 
tissues during development. BMP-2 (i.e., BMP-2A), for example, is expressed in embryonic 
mouse hair follicles, cartilage and bone (Lyons et al. (1989) Genes & Develop 1 Ml-ififiny 
BMPS has been shown to be most highly eq>ressed in human embiyonic lung and kidney, highly 
expressed in intestinal mucosa and skeletal tissues such as the pwichondrium and periosteum, 
0q)ressed in brain, but undetectable in embryonic heart and liver (Vukicevic et al. (1994) JL 
Histochepi, Cytochgm, 42:869-875); BMP4 has been shown to be expressed in the developing 
limbs, heart, fadal processes and condensed mesenchyme assodated with early whisker follicles in 
embryonic mice (Jones, et al. (1991) Development 1 1 1 :531-542); and OPl (BMP7) has been 
shown immunohistochranically to be present in human embryos in sclerotome, hypertrophied 
chondroqrtes, osteoblasts, paiosteum, adrenal cortex, renal convoluted tubules, placenta, 
smooth, cardiac and skeletal muscles, meninges and neural cells, as well as the basement 
membranes of the lungs, pancreas and skin (Vukicevic, et al. (1994) Biochem. Biophvs. Res 
Commun. 198:693-700). Some of the morphogens (e.g., 0P2 and BMP2) were not detected in 
analyses of adult tissues, suggesting only an early developmental role for these morphogens 
(Ozkaynak, et al. (1992) J. Biol. Chem 267:25220-25227). 
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Although, as noted above; several morphogens have been shown to be expressed in 
embryonic or adult mammalian heart tissue, and various utiUties for the morphogens have been 
proposed and developed, it has never previously been shown or suggested that treatment of 
myogenic precursor cells with the morphogens, morphogen inducers, agonists of moiphogen 
5 receptors, or small molecule morphogenic activators is useful in promoting the proUferation 
and/or differentiation of myogenic precursor cells into new and functional myocardium in a 
morphogenically permissive environment. Nor has it previously been shown or suggested that 
morphogenically-treated myogenic precursor cells are useful in the treatment of lost or damaged 
mammalian myocardium. 

^® Summarv of the Invention 

The present invention is directed to methods of treatment, and pharmaceutical 
preparations for use in the treatment, of mammalian subjects at risk of, or afflicted with, loss of or 
damage to myocardhim. Such subjects include subjects already afflicted with the loss of 
myocardial tissue, such as those which have aheady suffered a myocardial infarction, physical 

15 trauma to the heart (e.g., in an automobile accident, or those already suffering from congestive 
heart failure, as well as subjects reasonably expected to suffer from myocardial infarction or 
congestive heart failure. Whether a particular subject is at risk is a determination which may 
routinely be made by one of ordinary skill in the relevant medical or veterinary art. 

In these methods of treatment, myogenic precursor ceUs are implanted into a mammal at a 

20 site at risk of, or afflicted with, loss of or damage to myocardium, and the myogenic precursor 
cells are morphogenically-treated prior to, simultaneously with, or subjert to implantation. Thus, 
for example, morphogenicalfy-treated mammalian myogenic precursor ceUs may be implanted into 
a mammalian heart at the sute of a myocardial inferct, or uito the damaged or weakened 
myocardium of a subject with congestive heart Mure. The mammalian myogenic precursor cells 

25 may be derived from skeletal muscle (e.g., skeletal muscle satelUte cells), from embryonic tissue 
(e.g., embryonic mesodermal mesenchyme) or from a myogenic precursor cell line maintained in 
jdtto. Thus, the myogenic precursor cells may be derived from a donor (e.g., a tissue-type 
matched donor, sibling, identical twin, or fetus), may be derived from a tissue culture (e.g.. 
undifferentiated or parUy undifferentiated myogenic cells in culture; fetal tissue culture), or may 

30 be explanted from the subject and re-implanted after morphogen-induced proliferation and/or 
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diflFerentiation. Finally, the morphogenic treatment of the implanted cells may inchide treatment 
of the cells with a morphogen, morphogen inducer, agonist of a morphogen receptor, or small 
molecule morphogenic activator prior to implantation, simultaneously with implantation, or 
subsequent to implantation. 

The present invention is further directed to methods of promoting the proliferation and 
diflFerentiation of mammaUan myogenic precursor ceUs in vivo or in vitro Thus, for example, 
myogenic precursor cells isolated from mammalian skeletal muscle tissue, embryonic myogenic 
precursor cells, or myogenic precursor cell lines, may be stimulated to proliferate by treatment 
with a morphogen, an inducer of a morphogen, an agonist of a morphogen receptor, or a small 
molecule morphogenic activator. Alternatively, or in addition, mammalian myogenic precursor 
cells may be stimulated to differentiate into myocytes, particularly myocytes which express 
markers of myocardial tissue, m a morphogenically permissive environment. 

The present invention is further directed to therapeutic preparations comprising isolated 
mammalian myogenic precursor cells and an amount of a morphogen, inducer of a morphogen, 
agonist of a morphogen receptor, or small molecule morphogenic activator sufficient to promote 
proUferation or differentiation of the myogenic precursor cells in a morphogenically permissive 
environment. 

The methods and compositions of the present invention capitalize in part upon the feet 
that certain proteins of eukaryotic orijgin, defined herein as morphogens, may be used to treat 
myogenic precursor cells such that, when these morphogenicaUy-treated myogenic precursor cells 
are present in a morphogenically permissive environment, they may migrate, proliferate and/or 
differentiate so as to form new and functional myocardium. In particular, the present invention is 
based in part upon the feet that treatment of myogoiic precursor ceUs with these morphogens 
enhances or increases the probability, rate, or efficiency with which these cells migrate, proliferate 
ancl/or differentiate into new and fimctional myocardium in a morphogenically permissive 
environment. Thus, in accordance with the present invention, morphogenically-treated myogenic 
precursor ceUs may be used to restore or regenerate lost or damaged myocardium in a mammal, 
or to prophylactically treat a mammal at risk of such loss or damage. The present invention is 
novel in that myocardial tissue is believed to lack a sufficient number of myogenic precursor cells 
for adequate regeneration or repair of lost or damaged tissue and, therefore, the ability of the 
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morphogens to promote the migration, proliferation and/or differentiation of myogenic precursor 
cells (e.g., skeletal muscle satellite ceUs) into functional myocardium is unexpected. 

In preferred embodiments, the morphogen is a dimeric protein comprising a pair of folded 
polypeptides, each having an amino add sequence that shares a defined relationship with an amino 
5 acid sequence of a reference morphogen. Preferred morphogen polypeptides share a defined 
relationship with a sequence present in morphogenicalty active human OP-1 (SEQ ID NO: 4). 
However, any one or more of the naturafly occurring or biosynthetic sequences disclosed herein 
similarly could be used as a reference sequence. Preferred morphogen polypeptides share a 
defined relationship with at least the C-terminal six cysteine domain of human OP-I (residues 43- 

10 139 of SEQ ID NO: 4). Preferably, morphogen polypeptides share a defined relationship with at 
least the C-terminal seven cysteine domain of human OP-1 (residues 38-139 of SEQ ID NO: 4). 
That is, preferred morphogen polypeptides in a dimeric protein with morphogenic activity each 
comprise a sequence that corresponds to a reference sequence or is functionally equivalent 
thereto. Examples of preferred morphogens inchjde mammalian, and particularly human, OP-1, 

15 CBMP-2A (BMP-2) and CBMP.2B (BMP-4). 

Functionally equivalent sequences include functionally equivalent arrangements of cysteine 
residues disposed within the reference sequence, including amino acid insertions or deletions 
which alter the linear arrangement of these cysteines, but do not materially impair their 
relationship in the folded structure of the dimeric morphogen protein, including their ability to 

20 fonn such intra- or inter-chain disulfide bonds as may be necessary for morphogenic activity. 

Functionally equivalent sequoices fimher include those wherein one or more amino acid residues 
diflFers fi-om the corresponding residue of a reference mi>rphogen sequence, e.g., the C-terminal 
seven cysteine domain (also referred to herein as the conserved seven cysteine skeleton) of human 
OP-1, provided that this difference does not destroy morphograic activity. Accordingly, 

25 conservative substitutions ofcorresponding amino acids in the reference sequence are preferred. 
Amino acid residues that are "conservative substitutions" for corresponding residues in a 
reference sequence are those that are physically or fiinctionally similar to the corresponding 
reference residues, e.g., that have amilar size, shape, electric charge, chemical properties 
including the ability to form covalent or hydrogen bonds, or the like. Particularly preferred 

30 conservative substitutions are those fulfiUing tiie criteria defined for an "accepted point mutation" 
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in Dayhoff, et al. (1978) Atlas of Protein Sequence and Stn.rh.rP 5: Suppl. 3, ch. 22 (pp. 354- 
352), Natl. Biomed. Res. Found., Washington, D.C. 20007, the teachings of which are 
incorporated by reference herdn. 

In certain embodiments, a polypeptide suspected of being functionally equivalent to a 
reference morphogen polypeptide is aligned therewith using the method of Needleman, et al. 
(1970) J. Mol. Biol. 48:443-453, implemented com^eniently by computer programs such as the 
AUgn program (DNAstar, Inc.). As noted above, internal gaps and amino acid insertions in the 
candidate sequence are ignored for purposes of calculating the defined relationship, 
conventionaUy expressed as a level of amino acid sequence homology or identity, between the 
candidate and ref«^ence sequences. "Amino acid sequence homology" is understood herein to 
inchide both amino acid sequence identity and similarity. Homologous sequences share identical 
and/or similar amino acid residues, where similar residues are conservative substitutions for, or 
"aUowed point mutations" of, corresponding amino acid residues in an aligned reference 
sequence. Thus, a candidate polypeptide sequence that shares 70% amino acid homology with a 
reference sequence is one in which any 70% of the aligned residues are either identical to. or are 
conservative substitutions of, the corresponding residues in a reference sequence. 

The present invention alternatively can be practiced with methods and compositions 
comprising a morphogen inducer in lieu of a morphogen. A "morphogen inducer" is a compound 
that stimulates the production (i.e.. transcription, translation, and/or secretion) of moiphogen by a 
cell competent to produce and/or secrete a morphogen encoded within the genome of the cell. 
Endogenous or administered morphogens can act as endocrine, paracrine or autocrine factors. 
Therefore, an inducer of a morphogen may stimulate endogenous moiphogen synthesis by the 
cells in which the morphogenetic responses are induced, by neighboring cells in vivo or in vitro 
(e.g., in tissue culture) or by cells of a distant tissue invivb (in which case the secreted 
morphogen is transported to the site of morphogenesis, e.g., by the individual's bloodstream). In 
preferred embodiments, the inducer stimulates expression and/or secretion of a morphogen so as 
to increase amounts thereof available to mammalian myogenic precursor cells in vivo or in vitro . 
Thus, to promote the migration, proliferation and/or differentiation of myogenic precursor cells in 
vivo, an inducer of a morphogen may be administered to induce production of morphogen by the 
myogenic precursor cells themselves, or by other cells co-cultured with the myogenic precursor 
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cells. Similarly, to promote the proliferation and/or differentiation of myogenic precursor cells in 
vivo, an inducer of a morphogen may administered locally or systemically to induce morphogen 
production by the myogenic precursor cells themselves, or by neighboring or distant cells in a 
mammal's body. 

In stiU other embodiments, an agent which acts as an agonist of a morphogen receptor 
may be administered instead of the morphogen itself. An "agonist" of a receptor is a compound 
which binds to the receptor, and for which the result of such binding is similar to the insult of 
binding the natural, endogenous ligand of the receptor. That is, the compound must, upon 
interaction with the.receptor. produce the same or substantially similar transmembrane and/or 
intracellular eflRjcts as the endogenous ligand. Thus, an agonist of a morphogen receptor binds to 
the receptor and such binding has the same or a functionally similar result as morphogen binding 
(e.g., induction of morphogenesis). The activity or potency of an agonist can be less than that of 
the natural hgand. in which case the agonist is said to be a "partial agonist," or it can be equal to 
or greater than that of the natural ligand, in which case it is said to be a "full agonist." Thus, for 
example, a small peptide or other molecule which can mimic the activity of a morphogen in 
binding to and activating the morphogen's receptor may be employed as an equivalent of the 
morphogen. Preferably the agonist is a fiiU agonist, but partial morphogen receptor agonists may 
also be advantageously employed. Methods of identifying such agonists are known in the art and 
include assays for compounds which induce morphogen-mediated responses (e.g., induction of 
differentiation of metanephric mesenchyme, induction of endochondral bone formation, and the 
like). Such an agonist may also be referred to as a morphogen "mimic." -mimetic," or "analog." 

Alternatively, a smaU molecule morphogenic activator, as described herein, may be 
administered instead of the morphogen itself to promote the migration, proliferation, and/or 
differentiation of myogenic precursor cells by increasing tiie level of expression of proteins 
associated with myocardial phenotype. Exemplary methods comprise introducing a small 
molecule morphogenic activator that regulates some portion or portions of a moiphogen-induced 
regulatory patiiway, resulting in an eflfective increase in expression or activity of myocardium- 
specific protein. This may resuh either from stimulating an increase in the endogenous expression 
of such protein or from a decrease in the inhibition of normal expression of such protein. For 
example, a small molecule morphogenic activator may act at the type I or type II morphogen 
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receptor; or at the serine/threonine kinase, or other kinase domains of those receptors. Another 
target of pathway activation is the Smad proteins, including the monomeric, dimeric (including 
heteromeric and homomeric complexes) or trimeric forms (inchiding heteromeric and homomerib 
complexes). Alternately, a smaU molecule morphogenic activator may lead to activation of a 
transcription factor (for example, the X-protein shown in Figure 2) that causes phenotype-spedfic 
gene expression expression of protein duiracteristic of myocardhim). 

Preferably, the morphogens, morphogen inducers, agonists of morphogen receptors, or 
small molecule morphogenic activators are directly contacted with the myogenic precursor cells in 
solution either instofi prior to implantation, in vivo at the time of implantation, or in vivo 
subsequent to implantation. Akernatively, however, the morphogens, morphogen inducers, 
agonists of morphogen receptors may be admmistered by any route which is compatible with the 
selected agent, and may be formulated with any pharmaceutically acceptable carrier appropriate to 
the route of administration. Preferred systemic routes of administration are parenteral and, in 
particular, intravenous and intraperitoneal. 

In additional embodiments, the present mvention provides pharmaceutical compositions 
comprising a morphogen, or morphogen inducer, agonist of a morphogen receptor, or small 
molecule morphogenic activator in combination with one or more of a "muscle extract," 
conditioned medium from differentiated myotubes grown m culture, bFGF, IGF, PDGF, LIF, 
ACTH, MSH, or G-CSF. These compositions are usefiil in promoting the proliferation and/or 
differentiation of myogenic precursor cells. 

Brief Descri ption of the Fi piiwy 

Figure 1. Panels 1-1 through 1-12 of this figure are a tabular alignment of the amino add 
sequences of various naturally occurring morphogens with a preferred referaice sequence of 
human OPl, residues 38-139 of SEQ ID NO: 4. Morphogen polypeptides shown in this figure 
also are identified in the Sequence Listing. 

Figure 2. Figure 2 is a schematic representation of a morphogen-activated regulatory 
pathway for repression of a phenotype-specific gene. 

Detailed De scription of the Invention 

I. Definitions 
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In order to more clearly and concisely point out the subject matter of the claimed 
invention, the following definitions are provided for specific teims used in the following written 
description and appended claims. 

Subjects ^t risk of. or afflicted with, loss of or Ha m aee to mvnr^rA«^ n> As used herein, a 
5 subject (preferably a mammal, e.g., a human) is said to be at risk o^ or afflicted with, loss of or 
damage to myocardium, if the subject has suffered a loss of functional myocardial tissue which is 
clinically detectable in terms of reduced or altered cardiac function, or if the subject may 
reasonably be expected to suffer such a loss. Subjects at risk of, or aflOicted with, loss of or 
damage to myocardmm inchide, but are not limited to, subjects which have ah-eady suffered a 

10 myocardial inferction, which have suffered a physical trauma to the heart (e.g., in an automobile 
accident) which has reduced cardiac function, or which have already been diagnosed with 
congestive heart feihire; as well as subjects which can reasonably be expected to suffer a 
myocardial infarction or congestive heart failure. Whether a particular subject is at risk is a 
determination which may routinely be made by one of ordinary skUl in the relevant medical or 

15 veterinary art. 

Myogenic precursor cells. As used herein, the term "myogenic precursor cells" refers to 
ceUs capable of myogenesis, or the process of proliferation and differentiation into new and 
functional muscle when present in a morphogenically permissive environment. Myogenic 
precursor cells are variously referred to in the literature as "myoblasts," "muscle stem cells" or 
20 "satellite cells." 

MorphoRenicallv permissive environment As used herein, a "morphogenically permissive 
environment" is an environment which allows or promotes the differentiation of cells into a 
specific cell type or types. A "morphogenicaUy permissive environment* is, therefore, suflRcienUy 
fi-ee of inhibitors of cell differentiation to allow or promote ceU differentiation. In addition, a 
25 morphogenically permissive owironment is one which provides signals (e.g., tiirough cell-cell 
contact, cell-extracdhilar matrix contact, or diffusible factors) which allow or promote a 
pluripotent cell to follow a particular morphogenic pathway. In particular, with respect to 
myocardial differentiation, a morphogenically permissive environment includes an environment of 
mtact or damaged myocardial tissue which provides signals to myogenic precursor cells which 
allow or promote the differentiation of those cells into new and functional myocardium. It is 



30 
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known, for example, that myogenic precursor cells differentiate into myocytes at least partly in 
response to contact with the plasmalemma of a myofiber. The presence of myofiber 
plasmalemmas, therefore, may be one element of a morphogenically permissive environment for 
myogenesis. Similarly, electrical or biochemical stimuli from nerves, as well as a variety of 
growth factors (see below), appear to be elements of a morphogenically permissive environment 
for myogenesis. Thus, a morphogenically permissive environment may include one or more of 
these elements. 

n. Description of the Preferred Embodiments 
A. General 

The present invention depends, m part, upon the surprising discovery that 
morphogenically-treated manunalian myogenic precursors cells, when implanted in vivo at a site 
of lost or damaged manmialian myocardium, undergo a process of proliferation and/or 
differentiation to produce new and functional mammalian myocardium, thereby restoring or 
regenerating the lost or damaged tissue in whole or in part. This resuh is particularly unexpected 
in light of the fact that mammalian myocardial tissue is believed to lack a sufficient number of 
myogenic precursor cells for adequate regeneration or repair of lost or damaged tissue and, 
therefore, manmialian myocardium previously has been believed to be a poor responder for 
functional restoration or regeneration after tissue loss or damage. In addition, the presmt 
invention depends, in part, upon the surprising discovery that non-myocardial cells, such as those 
obtained from mammalian skeletal muscle or embryonic myogenic precursor cells, may be induced 
to proliferate and differentiate into myocardium in a morphogenically permissive environment. It 
is further surprising that the morphogens, morphogen inducers, agonists of morphogen receptors, 
and small molecule morphogenic activators, as described herein, may promote such restoration or 
regeneration despite the fact that they have no known role in myocardial tissue restoration or 
regeneration in the aduh manmial. 

Without being bound to any particular theory of the invention, it is believed that the 
morphogens, morphogen inducers, agonists of morphogen receptors, or small molecule 
morphogenic activators may promote the proliferation of myogenic precursor cells and render 
them more susceptible to differentiation into new and functional myocardium when implanted in a 
morphogenically permissive environment. Thus, it is believed that the morphogens, morphogen 
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inducers, agonists of morphogen receptors, or smaU molecule morphogenic activators may 
increase the pluripotentiality of these myogenic precursor cells, such that they may "switch fates" 
and, rather than differentiating only into smooth or skeletal muscle, they may proliferate and then 
differentiate into new and functional myocardiuRL 

B- Isolating and Culturing Mammalian Myogenic Precursor Cells 

Methods of isolating and culturing mammalian myogenic precursor cells are well- 
established in the art. For example, myogenic precursor cells may be obtained, as fiirther 
described in the examples below, by dissociation of skeletal muscle and subsequent culturing of 
the satellite cells. Alternatively, myogenic precursor cells may be obUiined from embryonic 
tissues, where they arise as fetal myoblasts from the myotomes of the somites, after induction of 
the mesoderm. Myogenic precursor cells may also be obtained from cell lines, such as a 
phiripotent mesodermal mesenchyme cell line or a partially dedifferentiated laboratory cell line, 
which may be induced to differentiate into myoblasts after implantation into a morphogenicaJly 
permissive environment. See, generally, Hathaway, et al. (1991) J. Cell. Phvsiol 146:435-441; 
Mezzogiomo et al. (1993) Mech. Ageing & Develop 70:35-44; Alameddine and Fardeau (1989); 
Chiu et al. (1995) Ann. Thorac. Surp 60:12-18. 

1- Isolating M yogenic Precursor Cells from Skeletal Muscle 

In preferred embodiments, the n^ogenic precursor cells are obtained from skeletal muscle. 
The skeletal muscle donor is preferably tiie subject for myocardial treatinent or an identical twn in 
order to avoid problems of histocompatibility and possible tissue rejection. Alternatively, other 
family mranbers or histocompatible donors, inchiding transgenic mammals raised for organ 
ti^plartation purposes (e.g., lacking MHC markers or expressing humanized MHC proteins), 
may be onployed as donors of the skdetai muscle tissue. Depending upon the degree of 
histocompatibility, standard metiiods of inmiunosuppression may be needed in conjunction with 
the present invention to prevoit rejection of the implanted cells. 

Briefly, a sample of skdetai muscle is excised from one or more skeletal muscles of a 
subject under local or general anestfiesia. Any excessive connective tissue and fasciae are 
dissected away, the muscle is rinsed in sterile solution, and the muscle is dissociated by. for 
©cample, mincing with scissors or passage through a meat grinder until substantially 
homogeneous. The amount of muscle excised will depend, of course, upon the quantity of 
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myogenic precursor cells required by the treatment, as well as the degree of myogenic precureor 
cell proliferation which is to be promoted in vitro . Typically, however, amounts of 1-100 grams, 
more preferably 10-50 grams, of skeletal muscle tissue are removed. Such quantities may be 
excised conveniently from one or more of the larger, relatively superficial muscles of the limbs 
5 (e.g., biceps brachii, triceps brachii, brachialis, brachioradialis, rectus femoris, biceps femoris, 
semitendinosus, gracilis, vastus lateralis, gastrocnemius, tibiaUs anterior), chest and shoulders 
(e g., pectoraUs, deltoid), pehns and hips (e.g., gluteus medius, ghiteus maximus), back (e.g., 
trapezius, latissimus dorsi) or abdomen (e.g., obUquus abdominis extemus, rectus abdominis), but 
may be obtained from any available skeletal muscle. 

10 Prefwably, the dissociated muscle then is incubated with a proteolytic enzyme (e.g., 

pronase (Sigma, St. Louis, MO), coUagenase (Sigma, St. Louis, MO), hyaluronidase (Sigma, St. 
Louis, MO), or trypsin (Difco Laboratories, Inc., Detroit, MI) at 37°C for 15 min to 1 hr to 
remove remaining connective tissue. The mass of digested muscle tissue optionally may be 
further dissociated by, for example, repeated pipetting or mixing. In addition, the digested mass 

1 5 optionally may be washed, pelleted and resuspended to remove digested connective tissue and 
enzyme, and any remaining debris may be removed by filti-ation. The cells are then suspended io a 
sterile buffer (e.g., phosphate buffered saline sohition) and centrifaged at approximately 500-550 
g for approximately 10 minutes to sediment the larger, multinucleated skeletal muscle fibers and 
myocytes, while leaving the sateUite ceUs in the supernatant. Kther before or after centoifugation, 

20 serum, such as fetal bovine serum (FBS, GIBCO BRL, Grand Island, NY), may be added to the 
mfarture to hah the enzymatic cleavage process and antibiotics may be added to prevent microbial 
growth. If desired, satellite cells may be separated from fibroblasts and other remaining ceUs 
uang a draisity centrifiigation metiiod (see, e.g.^ Yablonka-Reuvwii and Nameroff (1987) 
Histochemistry 87:27-38). 

25 2. Isolating Myogenic Prec ursor Cells from Embryos 

Myogenic precursors cells may be isolated from mammalian embryonic or fetal (together 
"embryonic") tissues at various stages of development after induction of tiie mesoderm. Thus, for 
example, myogenic precursor cells may be obtained from the embryonic mesoderm prior to its 
fijrtiier differentiation into dorsal, intermediate, and lateral mesodermal mesenchyme. After this 

30 stage of differentiation, any mesodermal ceUs may be employed but, preferably, cells are employed 
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which arise along the routes of differentiation toward skeletal or cardiac muscle. For example, 
the dorsal mesodermal mesenchyme differentiates to form the myotomes which, in turn, 
differentiate to form both the skeletal muscles of the trunk and the limb buds. The mesodermal 
mesenchyme of the limb buds further differentiates to form the skdetal muscles of the appendages 
(as well as the appendicular skeleton. Similarly, the feteral mesodennal mesenchyme 
differentiates, in part, to form the splanchic mesoderm which, in turn. diflFerentiates to form the 
myocardium and smooth muscles of the viscera (as wdl as the gonads, circulatory system and 
other primary elements of the viscera). One of ordinary skill in the art may, therefore, readily 
choose appropriate embryonic cells for use in the present invention (see, e.g., Soonpaa et al. 
(1994) Science 264:98-101; also see. generaUy, B.M. Carlson, ed. (1981) Patten's Foundation, of 
EmbryoloRY, 4th Edition McGraw-HUl, New York). Once excised, the embryonic tissue may be 
treated essentially as described above with respect to skeletal muscle to isolate the myogenic 
precursor cells. 

As with cells obtained from the skeletal muscle of an adult mammal, histocompatibility 
problems may arise upon implantation of embryonic myogenic precursor ceUs. Therefore, 
depending upon the degree of histocompatibility, standard methods of immunosuppression may be 
needed in conjunction with the present invention to prevent rejection of the implanted cells. 
3. Isolating Myogenic Precu rsor Cells fi-nm EstablLshed Cell Lines 

Established cell lines, including myogenic precursor cell lines, myoblast cell lines, or 
mesenchymal ceU lines, may also be employed m the present invention without the need for 
isolation of the myogenic precursor cells from adult or embryonic tissue. For example, the 
established murine myoblast cell line CjC^ (ATCC CRL 1772) has been implanted into mouse 
hearts and shown to differentiate mto functional myocardium and fuse with native myocardium 
(Koh et al. (1993) J. Clin. Invest. 92:1548-54). Alternatively, pbripotent mesodennal stem ceU 
lines, inchiding primaiy dermal fibroblast lines, smooth muscle cell lines, or chondroblast lineages 
may be caused to differentiate into muscle cells (see, e.g., Choi et al. (1990) Proc. Nat. Acad. Sci. 
(USA) 87:7988-7992). Finally, it should be noted that a variety of established mammalian 
myogenic cell lines are commerciaUy available for use in accordance with the present invention 
including, for example, the human cell line fflSM (ATCC CRL 1692), the murine cell lines 
C2C12 (ATCC CRL 1772), NOR-10 (ATCC CRL 197), and G-8 (ATCC CRL 1456), and the rat 
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cell lines A7r5 (ATCC CRL 1444). AlO (ATCC CRL 1476), H9c2 (2-1) (ATCC CRL 1446), L6 
(ATCC CRL 1458) and L8 (ATCC CRL 1 769). FoUowing essentially the same protocols as 
described in the original reports of these ceU lines (see the ATCC's Catalogue of Cell Lines & 
Hybridomas, for citations) one of ordinary skill m the art can readily produce comparable cell 
S lines from any mammalian species. 

4. Culturing Mvoggiic Precursor Cells 
Myogenic precursor cells may be cultured on solid or in Uquid media. Thus, for example, 
the myogenic precursor cells may be suspended in a flask of liquid medhim while maintaining mild 
or periodic agitation. Alternatively, the cells m^ be plated on a solid substrate and fed with a . 

10 liquid medium. Appropriate liquid media are well known in the art and inchide, but are not 
limited to, McCoy's, Ml 99, Minimal Essential Medium (MEM), Dulbecco's Modified Eagle 
Medium (commercially available from, for example, GmCO BRL, Grand Island, NY, or Sigma 
Chemical Company, St. Louis, MO), and the Uke. These media may, of course, be supplemented 
with additional buffers or nutrient sohitions (e.g., 10% fetal bovine serum, 3% horse serum), or 

15 with antimycotics and/or antibiotics (e.g., 50-5,000 lU/ml peniciUin, 50-5,000 ng/ml 

streptomycin, 5-50 ng/ml gentamicin). Prefer^ly, the liquid media is replaced every 24-48 hrs 
and the cultures are maintained at a relatively constant temperature of about 37''C under a normal 
or 5% C02-enridied humid atmosphere. For culturing on solid substrates, cells are preferably 
plated at a density of approximately 10*-10' cells per 60 mm plate. To promote cell adherence to 

20 solid substrates, the plates may optionally be coated with, for example, basement membrane 
matrigel or laminin (Sigma Chemical Company, St. Louis, MO) although, as described below, 
adherence and/or confluence may inhibit prolifo-ation. 

In order to allow or promote prolifisration of the myogenic precursor cells in vitro while 
inhiWting premature differentiation, a number of steps may be taken. For example, myogenic 

25 precursor cell proliferation has been shown to be inhibited by TGF-p (Allen and Boxhom ( 1 989) 
J. Cell Physiol. 138:31 1-315) and contact with myofiber plasmalemmas, (Bischoff(1989)); and 
has been shown to be promoted by a saline "muscle extract" (Bischoff (1986) Dev. Biol. 
115: 140), conditioned medium from differentiated myotubes grown in culture (Mezzogiomo et al. 
(1993) Mech. Ageing & Develop. 70:35-44), basic fibroblast growth factor (bFGF) (Clegg et al. 

30 (1987) J, Cell. Biol. 105:949-56), insulin-like growth factors (IGF) (Ewton and Florini (1977) 
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Endocrinology 106:577-587; Tollfsen et al. (1989) Proc. Nat. Acad. Sci. OJSA't 1543-1547), 
platelet-derived growth factor (PDGF) (Yablonka-Reuveni et al. (1990) J. Cell Biol. 1 1 1 :1623- 
1629), leukemia-inhibiting factor (LIF) (Austin and Burgess (1991) J. Neuro. Sci 101:193-197), 
adrenocorticotrophic hormone (ACTH) (Cossu et al. (1989) Develop. Biol 131:331-336; De 
Angelis et al. (1992) Dev. Biol. 151 :446.458), melanocyte-stitnulating factor (MSH) (Cossu et al. 
(1989) Develop. ^}ql. 131 :33 1-336) and granulocyte colony stimulating factor (G-CSF) (Austin 
and Burgess (1991) J. Neuro. Sci. 101 : 193-197). Thus, in order to promote proliferation of the 
myogenic precursors cells mJDHQ prior to implantation and/or in vivo after implantation, the cells 
may be grown in the presence of one or more of these factors, or other known mitogens. In 
addition, as is generally known in the ait, proliferation of such cells may be promoted by repeated 
passaging (e.g., treatment with dihite trypsin to remove adhered cells from the culture plate and 
replating at a lower density every 2-3 days), growth in liquid culture, growth in the absence of 
enhancers of cdl adhesion, growth in the presence of inhibitors of cell adhesion, and/or growth at 
densities below confluence. 

There is no absolute requirement that the myogenic precursor cells of the present 
invention be cultured injatro prior to implantation. Indeed, if a therapeutically effective number 
of myogenic precursor cells can conveniently and economically be obtained without culturing, this 
step may be omitted. On the other hand, when such cells are in scarce supply (e.g., from fetal 
tissues) or can be obtained only through invasive measures (e.g., excision of substantial portions 
of muscle from a donor or donor/subject), it is preferred that smaller numbers of cells be obtained 
initiaUy, and then proliferated in yjtrft. Doubling times wiU vary depending upon the source of 
cells, media, and the presence or absence of other growth fectors, but doubling times on the order 
of every 12 hrs have been reported in the literature for muscle satellite cells grotvn in the presence 
of muscle abstract (BischofF, (1989)). Therefore, it is contemplated that culturing times of several 
days to a week may be employed in the present methods to expand the myogenic precursor cell 
population prior to implantation. 

Myogenic precursor ceUs may be harvested by brief trypsin treatment to remove any cells 
adhered to the culture plate or vessel, and centrifijgation (e.g., 10-15 min at 500-1000 g). The 
ceUs may then be resuspended in a physiologicaUy acceptable buffer solution (e.g., PBS. Ringer's 
saline) at an appropriate density (e.g., lO^-lo' cells/ml). 
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Finally, it should be noted that morphogens, morphogen inducers, agonists of morphogen 
receptors, and small molecule morphogenic activators may be used to treat the myogenic- 
precursor cells during culturing (if any) to aid in proliferation and/or subsequent diflFerentiation. 
Alternatively, the myogenic precursor cells may be treated with a morphogen, morphogen 
inducer, agonist of a morphogen receptor, or small molecule morphogenic activator either 
simultaneously with, or subsequent to. implantation. In the case of morphogen inducers, the 
myogenic precursor cells may be co-cultured with auxiliary cells which respond to these 
morphogen inducers by producing morphogen. The myogenic precursor ceUs then may be 
impfcmted along with these auxiliary cells, or may be isolated from the co-culture by standard cell 
separation techniques, which are known in the art. but which will vary with the type of auxiliary 
cells employed (e.g., density centrifagation separation, cell type specific cytotoxins). 
C- Implantation of Mvope nic Precursor Cells at a Myocardial Site 
Myogenic precursor cells may be implanted at a site of loss of or damage to mammalian 
myocardhim by any of a variety of surgical techniques known in the art. These techniques range 
from the minimaUy invasive (e.g., injection by needle through the thoracic wall) to substantially 
invasive (e.g., thoracotomy and incision of the myocardium, followed by implantation, sutijring of 
the implant site and closing of the chest). The technique employed in any given instance wUl 
depend upon such factors as the size of tiie myocardial site to be treated, the accessibility of the 
site, and the age and stamina of the subject. 

Generally, the myogenic precursor cells are implanted in a physiologicaUy acceptable 
imffer solution. To minimize the vohmie of sohition administered to the treatinent site, the cells 
may be at a relatively high titer within this solution (e.g.. 10*-10' cdls/ml). The solution may 
contain growth fectors, as described above, to promote fiirtiier proliferation of the myogenic 
precursor ceUs witiiin tiie implant site, or nay be free of sudi factors so as to promote 
25 differentiation into new and functional myocardium in the morphogenically permissive 

environment of tiie myocardial implant site. In addition, as noted above, tiie myogenic precursor 
ceUs may be implanted eitiier simultimeously witii a morphogen, morphogen inducer, agonist of a 
morphogen receptor, or small molecule morphogenic activator, or the morphogenic treatment 
may be subsequent to implantation. 
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Thus, for example, a solution of myogenic precursor cells and a morphogen, morphogen 
inducer, agonist of a morphogen receptor, or small molecule morphogenic activator, may be 
implanted at a site of myocardial infarction in essentiaUy the foHowing manner. For example, to 
treat a myocardial infarct to the anterior wall of the left ventricle, a left thoracotomy is performed 
on a subject under general anesthesia in an intercostal space (e.g., the sixth intercostal space) and 
the site of the infarct is determined by observation. At the discretion of the surgeon, the heart 
may or may not be stopped and systemic blood flow shunted to a heart-lung machine. Myogenic 
precursor ceUs then may be directly injected into one or more sites within the infarct using an 
intravenous catheter (e.g., a 16-gauge Teflon catheter from Criticon, Tampa, FL). The initial 
injection(s) may inchide a morphogen. morphogen inducer, agonist of a morphogen receptor, or 
small molecule morphogenic activator, or tiiese may be included in one or more additional 
injections to the inferct site. Alternatively, a number of non-transmural incisions may be made at 

the site ofthe infarct to create "channels" paraUel to the direction ofthe myocardial fibers. The 
suspension of myogenic precursor cells (with or without morphogen, morphogen inducer or 
1 5 morphogen receptor agonist) then may be introduced within these channels and the channels 
closed by suturing. Finally, the pericardium is sutured and chest wall are closed by standard 
surgical techniques (after restarting and returning systemic circulation to tiie heart from a heart- 
lung machine, if employed). 

The treatment of chronically drteriorating mammalian nqrocardfaim (e.g., due to 
20 congestive heart feUure or chronic myopathy), may be performed simUarly except tiiat the 

implantation sites are chosen to correspond to areas of generalized myocardial deterioration and, 
therefore, may be more difiuse. 

The number of myogenic precursor cells implanted will vary according to the amount of 
ntyocardial tissue to be restored or regenerated. The vohmie of cells to be restored or 
25 regenerated may be ascertained by standard techniques of cardiac imaging. Generally, it is 

expected that on the order of approximately 10*- 10* myogenic precursor cells will be required to 
restore or regenerate 1 mg of myocardial tissue (see, e.g., Alameddine and Fardeau (1 989)). 
^- Morphoeens. Inducers. Agonists, and Small Molecule Morphogenic Activators 
Morphogens useful in the present invention include eukaryotic proteins originally 
30 identified as osteogenic proteins (see U.S. Patent 5,01 1.691, incorporated herein by reference). 
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such as the OPl, OP2, 0P3, CBMP2A (BMP-2X CBMP-2B (BMP-4) and BMP3 proteins (SEQ 
ID NOs: 4-9, 15-22, 25-27), as well as amino acid sequence-related proteins such as DPP (SEQ 
ID NO; 10, from Drosophila), Vgl (SEQ ID NO: 1 1. from Xenopus^ Vgrl (SEQ ID NO: 12, 
from mouse), GDFl (SEQ ID NOs: 13, 30 and.31, from humans, see Lee (1991), PNAS 
5 88:425(M254), 60A (SEQ ID NOs: 23 and 24, from Drosophila, see Wharton et al. (1991) 
mM 88:9214-9218), dorsalin-1 (from chick, see Easier et al. (1993) Cdl 73:687-702 and 
GenBank accession mimber L12032) and GDF5 (from mouse, see Stoim et al. (1994) Nature 
368:639-643). Additional usefiil morphogens inchide biosynthetic moiphogen constructs 
disclosed in U.S. Pat. No. 5,01 1.691, e.g.. COPl, 3-5, 7 and 16, as well as others known in the 
10 art inchiding dor3, NODAL. UNIVIN, BMP9, BMPIO, GDF3, GDF6, GDF7, CDMP2, and 
SCREW. See also U.S. Pat No. 4,968,590, incorporated herein by reference. 

Naturally occurring proteins identified and/or appreciated herein to be morphogens form a 
distinct subgroup within the loose evolutionary grouping of sequence-related proteins known as 
the TGFP superfamily or supergene family. The naturally occurring morphogens share substantial 
amino acid sequence homology in tiieir C-terminal regions (domains). Typically, the above- 
mentioned naturally occurring morphogens are translated as a precursor, having an N-terminal 
signal peptide sequence, typically less tiian about 30 residues, followed by a "pro" domain that is 
cleaved to yield tiie mature C-terminal domain. The signal peptide is cleaved rapidly upon 
translation, at a cleavage site tiiat can be predicted in a given sequence using the metiiod of Von 
Hdjne (1986) Nuclw Apidg Research 14:4683-4691. The pro domain typically is about tiiree 
times huger tiian tiie fully processed mature C-terminal domain. Herein, the "pro" form of a 
morphogen refers to a morphogen comprising a folded pair of polypeptides each comprising the 
pro and matiire domains of a morphogen polypeptide. Typically, tiie pro form of a morphogen is 
more sohible than the mature form under physiological conditions. The pro form appears to be 
25 the primary form secreted from cultured mammalian host cells. 

Table 1, below, summarizes various naturally occurring morphogens identified to date, 
including tiieir nomenclature as used herein, their Sequence Listing references, and publication 
sources for the amino acid sequences for the foil length proteins not included in the Sequence 
Listing. Each of the generic terms set forth in Table 1 is intended and should be understood to 
embrace morphogenically active proteins expressed from nucleic acids encoding tiie identified 
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sequence mentioned below and set forth in the Sequence Listing, or a morphogoiically active 
fragment or precursor thereof, including functional equivalents such as naturally occuning and 
biosynthetic variants thereof Naturally occurring variants include allelic variant forms isolated 
from other individuals of a smgle biological species, and phylogenetic counterpart (spedes) 
variant forms (homologues) isolated from phylogenetically distinct biological species. The 
disclosures of pubUcations mentioned below are incorporated herein by reference. 

TABLE 1 

"OPl » Refers genericaUy to morphogenically active proteins expressed from nucleic adds 

encoding OPl proteins, inchiding at least the human OPl protein disclosed in SEQ 
ID NO: 4 ("hOPl"), and the mouse OPl protein disdosed in SEQ ID NO: 5 
("mOPl "). In each of human and mouse OPl proteins, the conserved seven 
cysteine skeleton is defined by residues 38 to 1 39. cDNA sequences and amino 
add sequences encoded therdn and corresponding to the full length proteins are 
provided in SEQ ID NOs: 15 and 16 (hOPl) and SEQ ID NOs: 17 and 18 (raOP- 
1 .) The mature proteins are defined by residues 293-43 1 (hOPl) and 292-430 
(mOPl). The "pro" regions of the proteins, deaved to yield the mature, 
morphogenically active proteins are defined essentially by residues 30-292 (hOPl) 
and residues 30-29 1 (mOP 1). 

"OP2" Refers genericaUy to morphogenically active proteins expressed from nuddc adds 

encoding the 0P2 proteins, including at least the human 0P2 protdn disdosed in 
SEQ ID NO: 6 ("h0P2"), and the mouse 0P2 protdn disdosed in SEQ ID NO: 7 
("mOP2"). In each of human and mouse 0P2 protdns, the conserved seven 
cysteine skdeton is defined by residues 38 to 139 of SEQ ID NOs: 6 and 7. cDNA 
sequences and amino acid sequences encoded therein and correspondbg to the fiill 
length protdns are provided in SEQ ID NOs: 19 and 20 (hOP2) and SEQ ID NOs: 
21 and 22 (mOP2.) The mature protdns are d^ned essentially by residues 264- 
402 (hOP2) and 261-399 (mOP2). The "pro" regions of the proteins, cleaved to 
jndd the mature, morphogenically active proteins are defined essentially by 
residues 18-263 (hOP2) and residues 18-260 (mOPl). 
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"0P3" Refers generically to morphogenically active proteins expressed from nucleic adds 

encoding OPS proteins, including at least the mouse 0P3 protein disclosed in SEQ 
ID NO: 26 ("mOP3"). The conserved seven cysteine domain is defined by residues 
298 to 399 of SEQ ID NO: 26, which shares greater than 79% amino add identity 
5 with the corresponding mOP2 and hOP2 sequences, and greater than 66% identity 

with the correspondmg OPl sequences. A cDNA sequence encoding the above- 
mentioned amino add sequence is provided in SEQ ID NO: 25. 0P3 is unique 
among the morphogens idraitified to date in that the residue at poation 9 in the 
conserved seven cystdne domain (e.g., residue 315 of SEQ ID NO: 26) is a serine, 
*® whereas other morphogens typically have a tryptophan at this location. 

"CBMP2" Refers generically to morphogenically active proteins expressed from nucleic adds 
encoding the CBMP2 protdns, inchiding at least the human CBMP2A protdn 
disclosed in SEQ ID NO: 8 (hCBMP2A) and the human CBMP2B protdn 
disclosed in SEQ ID NO: 9 (hCBMP2B)^ The amino add sequence for the full 

length protdns, referred to in the literature as BMP2A and BMP2B, or BMP2 and 
BMP4, appear in Wozney, et al. (1988) Science 242:1528-1534. The pro domain 
for BMP2 (BMP2A) likely includes residues 25-248 of the published sequence; the 
mature protein, residues 249-396. The pro domain for BMP4 (BMP2B) likely 
includes residues 25-256 of the published sequence; the mature protein, 
20 residues 257-408. 

"DPP" Refers generically to proteins encoded by the aiMo^Ia DPP gene and defining at 
least the consaved seven cysteine skeleton of SEQ ID NO: 1 0. The amino add 
sequence for the full length protein appears in Padgett, et al. (1987) Nature 
325:81-84. The pro domain likely extends from the signal peptide cleavajge site to 
25 residue 456 ofthe published sequence; the mature protein likely is defined by 

residues 457-588. 

"Vgl" Refers generically to protdns encoded by the Xenopus Vgl gene and defining at 

least the conserved seven cysteine skeleton of SEQ ID NO: 1 1 . The amino acid 
sequence for the fiiU length protein appears in Weeks (1 987) Cdl 5 1 :861-867. The 
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10 



15 "60A" 



20 



"BMP3" 



25 



prodomain likely extends from the signal peptide cleavage site to residue 246 of 
the published sequence; the mature protein likely is defined by residues 247-360. 

Refers generically to protdns encoded by the murine Vgrl gene and defining at 
least the conserved seven cysteine skeleton of SEQ ID NO; 12. The amino acid 
sequence for the fiill length protein appears in Lyons, et al. (1989) PNAS 86:4554- 
4558. The prodomain likely extends fi-om the signal peptide cleavage site to 
residue 299 of the published sequence; the mature protein likely is defined by 
residues 300-438. 

Refers generically to protdns encoded by the human GDFl gene and defining at 
least the conserved seven cysteine skeleton of SEQ ID NO: 13. The cDNA and 
encoded amino sequence for the fiill length protein are provided in SEQ 
ID NOs: 30 and 3 1 . The prodomain likely extends from the signal peptide 
cleavage site to residue 214; the mature protein likely is defmed by residues 215- 
372. 

Refers generically to morphogenically active proteins expressed from nucleic acid 
encoding 60A proteins or morphogenically active fragments thereof, including at 
least the Drosophila 60A protein disclosed in SEQ ID NO: 24. A Drosophila 60A 
cDNA is disclosed in SEQ ID NO: 23. The prodomain likely extends from the 
signal peptide cleavage site to residue 324; the mature protein likely is defined by 
residues 325-455. The active fragment of 60A protein likely is defined by the 
conserved seven cysteine skeleton of residues 354 to 455 of SEQ ID NO: 24. The 
60A protein is considered likely herein to be a phylogenetic counterpart variant of 
the human and mouse OPl genes; Sampath, et al. (1993) PNAS 90:6004-6008. 

Refers generically to proteins encoded by the human BMP3 gene and defining at 
least the conserved seven cysteine skeleton of SEQ ID NO: 27. The amino acid 
sequence for the fiill length protein appears in Wozney, et al. (1988) Science 
242: 1 528- 1 534. The pro domain likely extends from the signal peptide cleavage 
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"BMP5" 



"BMP6'' 



site to residue 290 of the published sequence; the mature protein likely is defined 
by residues 291-472. 

Refers generically to proteins encoded by the human BMP5 gene and defining at 
least the conserved seven cysteine skeleton of SEQ ID NO: 28. The amino acid 
sequence for the fiill length protein appears in Celeste, et al. ( 1 99 1 ) PNAS 
87:9843-9847. The pro domain likely extends from the signal peptide cleavage site 
to residue 3 1 6 of the published sequence; the mature protein likely is defined by 
residues 317-454. 

Refers generically to protdns encoded by the human BMP6 gene and defining at 
least the conserved seven cysteine skeleton of SEQ ID NO: 29. The amino acid 
sequence for the fill! length protein appears in Celeste, et al. (1990) PNAS 
87:9843-5847. The pro domain likely extends fl-om the signal peptide cleavage site 
to residue 374 of the published sequence; the mature protein likely is defined by 
residues 375-513. 



As shown in Figure 1, the 0P2 and 0P3 proteins have an additional cysteine residue in the 
conserved C-terminal region (e.g., see residue 41 of SEQ ID NOs: 6 and 7), in addition to the 
consoved cystdne skdeton or domain in common with the other known proteins in this femily. 
The GDFl protein has a four amino add insert within the conserved skdeton (residues 44-47 of 
SEQ ID NO: 13) but this insert likdy does not interfere with the rdationship of the cysteines in 
20 the folded structure. Further, the CBMP2 protdns are missing one amino acid residue within the 
cystdne skdeton. Thus, these morphogen polypeptides illustrate the prindples of alignment used 
herdn with respect to the preferred reference morphogen sequence of human OPl, residues 38- 
139ofSEQIDNO:4. 

In certain preferred embodiments, morphogens usefiil herdn indude those in which the 
amino acid sequences of morphogen polypeptides comprise a sequence sharing at least 70% 
amino acid sequence homology or "sinularity", and preferably 80% homology or similarity with a 
reference morphogen sequence sdected firom the foregoing sequences or naturally occurring 
morphogens. Preferably, the referoice morphogen is human OPl, and the reference sequence 
thereof is the C-terminal seven cystdne domain present in morphogenically active forms of human 



wo 98/27995 



FCT/US97/23611 



-25- 

OPl, residues 38-139 of SEQ ID NO: 4. Morphogens useful herein accordingly include alleles, 
phylogenetic counterparts and other variants of the preferred reference sequence, whether 
naturally-occurring or biosynthetically produced (e.g., inckiding "muteins" or "mutant proteins"), 
as well as noyel members of the morphogenic family of proteins including the morphogens set 
5 forth and identified above, e.g., in connection with Table 1 . Certain particularly preferred 
morphogen polypeptides share at least 60% amino acid identity with the preferred reference 
sequence of human OPl, still more preferably at least 65% amino acid identity therewith. 

In other preferred embodiments, the family of morphogen polypeptides useful in the 
present invention, and members thereof, are defined by a generic amino acid sequence. For 

10 example. Generic Sequence 7 (SEQ ID NO: 1) and Generic Sequence 8 (SEQ ID NO: 2) 

disclosed below, acconmiodate the homologies shared among preferred morphogen protein family 
members identified to date, including at least OPl, OP2, OP3, CBMP2A, CBMP2B, BMP3, 
BMP5, BMP6, DPP, Vgl, Vgrl, 60A, and GDFl. The amino acid sequences for these proteins 
are described herein (see Sequence Listing) and/or in the art, as summarized above. The generic . 

15 sequences include both the amino acid identity shared by these sequences in the C-terminal 
domain, defined by the six and seven cysteine skeletons (Generic Sequences 7 and 8, 
respectively), as well as alternative residues for the variable positions withm the sequence. The 
generic sequences provide an appropriate cysteine skeleton where inter- or intramolecular 
disulfide bonds can form, and contain certain critical amino acids likely to influence the tertiary 

20 structure of the folded protems. In addition, the generic sequences allow for an additional 
cysteine at position 41 (Generic Sequence 7) or position 46 (Generic Sequence 8), thereby 
encompassing the morphogenically active sequences of 0P2 and 0P3 . 

Generic Sequence 7 fSEO ID NO: H 
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Xaa 


Xaa 


Xaa 


Phe 


Xaa 


Xaa 
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Xaa 


Gly 


Trp 
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Xaa 
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Xaa 


Xaa 
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15 


Xaa 
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Xaa 


Xaa 


Xaa 
20 
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Ala 
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Xaa 
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Xaa 
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Xaa 
50 
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Xaa 
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85 


Xaa 


Xaa 


Xaa 


Met 
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Val 


Xaa 


Xaa 


Cys 


Xaa 


Cys 


Xaa 



90 95 



Wherein each Xaa independently is selected from a group of one or more specified amino acids 
defined as follows: "res." means "residue" and Xaa at res. 2 = (Tyr or Lys); Xaa at res. 3 = Val or 
He); Xaa at res. 4 = (Ser, Asp or Glu); Xaa at res. 6 = (Arg, Gin, Ser, Lys or Ala); Xaa at res. 7 = 
5 (Asp or Glu); Xaa at res. 8 = (Leu, VaJ or He); Xaa at res. 1 1 = (Gin, Leu, Asp, His; Asn or Ser); 
Xaa at res. 12 = (Asp, Arg, Asn or Glu); Xaa at res. 13 = (Trp or Ser); Xaa at res. 14 = (lie or 
Val); Xaa at res. 1 5 = (lie or Val); Xaa at res. 1 6 (Ala or Ser); Xaa at res. 1 8 = (Glu, Gin, Leu, 
Lys, Pro or Arg); Xaa at res. 19 = (Gly or Ser); Xaa at res. 20 = (Tyr or Phe); Xaa at res. 21 = 
(Ala, Ser, Asp, Met, His, Gin, Leu or Gly); Xaa at res. 23 = (Tyr, Asn or Phe); Xaa at res. 26 = 

10 (Glu, His, Tyr, Asp, Gin, Ala or Ser); Xaa at res. 28 = (Glu. Lys, Asp, Gin or Ala); Xaa at res. 30 
= (Ala, Ser, Pro, Gin, He or Asn); Xaa at res. 3 1 = (Phe, Leu or Tyr); Xaa at res. 33 = (Leu, Val 
or Met); Xaa at res. 34 = (Asn, Asp, Ala, Thr or Pro); Xaa at res. 35 = (Ser, Asp, Glu, Leu, Ala 
or Lys); Xaa at res. 36 = (Tyr, Cys. His. Ser or De); Xaa at res. 37 = (Met. Phe, Gly or Leu); Xaa 
at res. 38 = (Asn, Ser or Lys); Xaa at res. 39 = (Ala. Ser. Gly or Pro); Xaa at res. 40 = (Thr, Leu 

15 or Ser); Xaa at res. 44 = (De, Val or Thr); Xaa at res. 45 = (Val, Leu, Met or He); Xaa at 

res. 46 = (Gin or Arg); Xaa at res. 47 = (Thr, Ala or Ser); Xaa at res. 48 = (Leu or He); Xaa at 
res. 49 = (Val or Met); Xaa at res. 50 = (His, Asn or Arg); Xaa at res. 51 = (Phe, Leu, Asn, Ser, 
Ala or Val); Xaa at res. 52 = (He, Met, Asn, Ala, Val, Gly or Leu); Xaa at res. 53 = (Asn. Lys, 
Ala, Ghi, Gly or Phe); Xaa at res. 54 = (Pro, Ser or Val); Xaa at res. 55 = (Glu, Asp, Asn, Gly, 

20 Val. Pro or Lys); Xaa at res. 56 = (Thr, Ala. Val, Lys, Asp, Tyr, Ser, Gly, He or His); Xaa at 

res. 57 = (Val, Ala or He); Xaa at res. 58 = (Pro or Asp); Xaa at res. 59 = (Lys, Leu or Glu); Xaa 
at res. 60 = (Pro, Val or Ala); Xaa at res. 63 = (Ala or Val); Xaa at res. 65 = (Thr, Ala or Glu); 
Xaa at res. 66 = (Gin, Lys, Arg or Glu); Xaa at res. 67 = (Leu, Met or Val); Xaa at res. 68 = 
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(Asn, Ser, Asp or Gly); Xaa at res. 69 = (Ala, Pro or Ser); Xaa at res. 70 = (He, Thr, Val or Leu); 
Xaa at res. 71 = (Ser, Ala or Pro); Xaa at res. 72 = (Val. Leu, Met or lie); Xaa at res. 74 = (Tyr 
or Phe); Xaa at res. 75 = (Phe, Tyr, Leu or His); Xaa at res. 76 = (Asp, Asn or Leu); Xaa at 
res. 77 = (Asp, Glu, Asn, Arg or Ser); Xaa at res. 78 = (Ser, Gin, Asn. Tyr or Asp); Xaa at 
res. 79 = (Ser, Asn, Asp, Gha or Lys); Xaa at res. 80 = (Asn, Thr or Lys); Xaa at res. 82 = aie. 
Val or Asn); Xaa at res. 84 = (Lys or Arg); Xaa at res. 85 = (Lys, Asn, Gin. His. Arg or Val); Xaa 
at res. 86 = (Tyr, Glu or His); Xaa at res. 87 = (Arg, Gin, Ghi or Pro); Xaa at res. 88 = (Asn, Glu, 
Trp or Asp); Xaa at res. 90 = (Val, Thr, Ala or He); Xaa at res. 92 = (Arg, Lys, Val, Asp, Gin or 
Ghi); Xaa at res. 93 = (Ala, Gly, Glu or Ser); Xaa at res. 95 = (Gly or Ala) and Xaa at res. 97 = 
(HQs or Arg). 

Generic Sequence 8 (SEQ ID NO: 2) includes all of Generic Sequence 7 and in addition 
inchides the foUowing sequence (SEQ ID NO: 14) at its N-terminus: 

Cys Xaa Xaa Xaa Xaa 
1 5 

Accordingly, beginning with residue 7, each "Xaa" in Generic Sequence 8 is a specified amino 
acid defined as for Generic Sequence 7, with the distinction that each residue number described 
for Generic Sequence 7 is shifted by five in Generic Sequence 8. Thus, "Xaa at res. 2 =(Tyr or 
Lys)" in Generic Sequence 7 refers to Xaa at res. 7 in Generic Sequence 8. In Generic Sequence 
8, Xaa at res. 2 = (Lys, Arg, Ala or Gh); Xaa at res. 3 = (Lys, Arg or Met); Xaa at res. 4 = (His, 
Arg or Gin); and Xaa at res. 5 = (Glu, Ser, His, Gly, Arg, Pro, Thr, or Tyr). 

As noted above, certain currently preferred morphogen polypeptide sequences usefiil in 
this invention have greater than 60% identity, preferably greater than 65% identity, with the 
amino acid sequence defining the conserved six or seven cysteine skeleton of hOPl (e.g., residues 
43-139 or 38-139 of SEQ ID NO: 4). These particularly preferred sequences include aUeUc and 
phylogenetic counterpart variants of the OPl and 0P2 proteins, including the Drosophila 60A 
protein (SEQ ID NO: 24). Accordingly, in certain particularly preferred embodiments, useful 
morphogens inchide active proteins comprising pairs of polypeptide chains within the generic 
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amino acid sequence herein referred to as "OPX" (SEQ E) NO: 3), which corresponds to the 
seven cysteine skeleton and accommodates the homologies between several identified variants of 
OPl and 0P2. As described therein, each Xaa at a given position independentiy is selected from 
the residues occurring at the corresponding position in the C-terminal sequence of mouse or 
human OPl or 0P2 (see SEQ ID NOs: 4-7 and/or SEQ ID NOs: 15-22). 

In still other preferred «anbodiments, useful morphogwi polypeptides have amino acid 
sequences comprising a sequence encoded by a nucleic acid that hybridizes, under stringent 
hybridization conditions, to DNA or RNA encoding reference morphogen sequences, e.g., C- 
taminal sequences defining the conserved sevai cystdne domains of OPl or 0P2, e.g., 
nucleotides 1036-1341 and nucleotides 1390-1695 of SEQ ID NO: 15 and 19, respectively. As 
used herein, stringent hybridization conditions are defined as hybridization according to known 
techniques in 40% formamide, 5 X SSPE, 5 X Denhardt's Solution, and 0. 1% SDS at 37°C 
overnight, and washing in 0. 1 X SSPE, 0. 1% SDS at 50°C. 

As noted above, morphogens usefiil in the present invention generally are dimeric proteins 
comprising a folded pair of the above polypeptides. Morphogens are inactive when reduced, but 
are active as oxidized horaodimers and when oxidized in combination with other morphogens of 
this invention to produce heterodimers. Thus, members of a folded pair of morphogen 
polypeptides in a morphogenicaUy active protein can be selected independentiy from any of tiie 
specific morphogen polypeptides mentioned above. 

The morphogois usefiil m the metiiods, compositions and devices of this mvention inchide 
protems comprising any of tiie polypeptide chains described above, whetiier isolated fit)m 
naturaUy-occurring sources, or produced by recombinant DNA or otiier synUietic techniques, and 
includes aUelic and phylogenetic counterpart variants of tiiese protrans, as well as biosynthetic 
variants (mutdns) thereof, and various truncated and fiision constiucts. Deletion or addition 
25 mutants also are enviaoned to be active, including tiiose which may alter the conserved C- 

termmal six or seven cysteine doman, provided tiiat the alteration does not fiinctionally disrupt 
the relationship of these cysteines in tiie folded, biologicaUy active, structure. Accordingly, such 
active forms are considered the equivalent of the specifically described constructs disclosed 
herein. The proteins may inchide forms having varying glycosylation patterns, varying N-termini, 
a family of related proteins having regions of amino acid sequence homolosr, and active truncated 
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30 



wo 98/27995 



PCT/US97/23611 



- 29 - 

or mutated forms of native or biosynthetic proteins, produced by expression of recombinant DNA 
in. host cells. 

The morphogenic proteins can be expressed from intact or truncated cDNA or from 
synthetic DNAs in prokaryotic or wikaiyotic host cells, and purified, cleaved, refolded, and 
dimerized to form morphogenically active compositions. Currently preferred host cells inchide K 
M or mammalian cells, such as CHO, COS or BSC cells. A detailed description of the 
morphogens usefiil in the methods, compositions and devices of this invention is disclosed in 
pubhshed application W092/15323, the disclosure of which is incorporated by reference herein. 

Thus, in view of this disclosure, skiDed genetic en^neers can isolate genes from cDNA or 
genomic Ubraries of various different biological species, which encode appropriate amino acid 
sequences, or construct DNAs from oligonucleotides, and then can express them in various types 
of host cells, including both prokaryotes and eukaryotes, to produce large quantities of active 
proteins capable of stimulating the morphogenesis of, and/or inhibiting damage or loss of, 
mammalian myocardial tissue. 

As noted above, a protein is morphogenic herein generally if it induces the developmental 
cascade of cellular and molecular events that culminate in the formation of new, organ-specific 
tissue. Preferably, a morphogen comprises a pair of polypeptides having a sequence that 
corresponds to or is fimctionally equivalent to at least the conserved C-terminal six or seven 
cysteine skeleton of human OPl, inchided in SEQ ID NO: 4. The morphogens generally are 
competent to induce a cascade of events inchiding all of the foUowing, in a morphogenically 
pomissive environment: stimulating prolifaration of progenitor cells; stimulating the 
differentiation of progoiitor cells; stimulating the proliferation of differentiated cells; and 
supportitig the growth and maintenance of differentiated cells. Details of how the morphogens 
usefiil in this invention first were idaitified, as weU as a description on how to make, use and test 
them for morphogenic activity are disclosed in published application W092/15323. As disclosed 
therein, the morphogens can be purified from naturally-sourced material or recombinantly 
produced from prokaryotic or eukaiyotic host cells, using the genetic sequences disclosed therein. 
Alternatively, novel morphogenic sequences can be identified following the procedures disclosed 
therein. 
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Exemplary useful morphogens include naturaUy derived proteins comprising a pair of 
polypeptides, the amino acid sequences of which comprise sequences selected from those 
disclosed in the Sequence Listing and Figure 1 . Other useful sequences include those of the 
naturaUy derived morphogens dorsalin-1, SCREW, NODAL, UNIVIN and GDF5, discussed 
5 herein in connection with Table 1, as well as biosynthetic constructs disclosed in U.S. Pat. 

5,01 1,691, the disclosure of which is incorporated herein by reference (e.g., COPl, C0P3, C0P4, 
COPS, COP7, and C0P16). 

Accordingly, certain preferred morphogens usefid in the methods and compositions of this 
invention can be described as morphogenicaUy active proteins having amino add sequences 

10 sharing 70% or, preferably, 80% homology witii a reference morphogen sequence described 
above. e.g., residues 38-139 of SEQ ID NO: 4, where "homology" is as defined herein above. 
Alternatively, in other preferred embodiments, morphogens useful in the methods and 
con^ations disclosed herein fall within the family of polypeptides described by Generic 
Sequence 7, SEQ ID NO: 1 , more preferably by Generic Sequence 8, SEQ ID NO: 2. 

15 Figure 1 herein sets forth an alignment of the amino acid sequences of the active regions 

of exemplary naturally occurring proteins that have been identified or appreciated herein as 
morphogens, including human OPl (hOPl, SEQ ID NOs: 4 and 15-16), mouse OPl (mOPl, 
SEQ ID NOs: 5 and 1 7- 1 8), human and mouse 0P2 (SEQ ID NOs: 6, 7, and 19-22), mouse OP3 
(SEQ ID NOs: 25-26), CBMP2A (SEQ ID NO: 8), CBMP2B (SEQ ID NO: 9), BMP3 (SEQ ID 

20 NO: 27), DPP (from Drosophila, SEQ ID NO: 1 0), Vgl, (from Xenopus. SEQ ID NO: 1 1), Vgr 1 
(from mouse, SEQ ID NO: 12), GDFl (from mouse and/or human, SEQ ID NOs: 13, 30 and 3 1), 
60A protein (from Drosophila, SEQ ID NOs: 23 and 24), BMP5 (SEQ ID NO: 28) and BMP6 
(SEQ ID NO: 29). The sequences are aligned essentially following tiie metiiod of Needleman, et 
al. (1970) J, Mol, Pipl,. 48:443-453, calcukted using the Align Program (DNAstar, Inc.). In 

25 Figure 1 , three dots indicates that tiie amino acid in that position is tiie same as the corresponding 
amino acid in hOPl . Three dashes indicates tiiat no amino acid is present in that position, and are 
inchided for purposes of iUustrating homologies. For example, amino acid residue 60 is "missing" 
in botii CBMP2A and CBMP2B. Of course, botii of tiiese amino acid sequences in this region 
comprise Asn-Ser (residues 58, 59), with CBMP2A tiien comprising Lys and He, whereas CBMP- 

30 2B comprises Ser and He. Figure 1 also iUustrates the handling of insertions in tiie morphogoi 
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amino acid sequence: between residues 56 and 57 of BMP3 is an inserted Val residue; between 
residues 43 and 44 of GDFl is inserted the amino acid sequence, Gly-Gly-Pro-Pro. Such 
deviations from the reference morphogen sequence are ignored for purposes of calculating the 
defined relationship between, e.g., GDFl and hOPl. As is apparent from the amino acid 
sequence comparisons set forth in Figure 1. significant amino acid changes can be made from the 
reference sequence whUe retaining morphogenic activity. For example, while the GDFl protein 
sequence depicted in Figure 1 shares only about 50% amino acid identity with the hOPl sequence 
described therein, the GDFl sequence shares greater than 70% amino acid sequence homology 
with the hOPl sequence, where "homology" is as defined above. 

In other embodiments, as an alternative to the administration of a morphogenic protein, an 
eflFective amount of an agent competent to stimulate or induce increased endogenous moiphogen 
expression in a mammal may be administered by any of the routes described herein. Such an 
inducer of a morphogen may be provided to a mammal, e.g.. by local or systemic administration 
to the mammal or by direct administration to implanted myogenic precursor cells, or may be 
provided to auxiliary cells co-cultured with myogenic precursor cells. Methods for identifying 
and testing inducers (stimulating agents) competent to modulate the level of production of 
morphogens by a given tissue or cell type are described in detail in published applications 
WO93/05172 and WO93/05751, the teachings of which are incon,orated herein by reference. 
Briefly, candidate compounds can be identified and tested by incubation in vitro with a test tissue 
or ceUs thereof, or a cultured cell line derived tiierefrora, for a time suflBcient to allow the 
compound to affect the production, i.e., tiie expression and/or secretion, of a morphogen 
produced by the ceUs of that tissue. Suitable tissue, or cultured cells of a suitable tissue, 
preferably can be selected from renal epithefium. ovarian tissue, fibroblasts, and osteoblasts. 

In oti»er embodiments, an agent which acts as an agonist of a morphogen receptor may be 
administered instead of tiie morphogen itself. Such an agent may also be referred to an a 
morphogen ••mimic," "mimetic," or "analog." Thus, for example, a small peptide or other 
molecule which can mimic tiie activity of a morphogen in binding to and activating the 
morphogen's receptor may be employed as an equivalent of the morphogen. Preferably the 
agonist is a fiill agonist, but partial morphogen receptor agonists may also be advantageously 
employed. Metiiods of identifying such agonists are known in tiie art and inchide assays for 
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compounds which induce morphogen-mediated responses (e.g.. induction of differentiation of " 
metanephric mesenchyme, induction of endochondnd bone formation). For example, methods of 
identifying morphogen bducers or agonists of morphogen receptors may be found in US. Ser. 
No. 08/478,097 filed June 7. 1995 and U.S. Ser. No. 08/507.598 filed July 26, 1995, the 
disclosures of which are incorporated herein by reference. 

In yet other embodiments, a smaU molecule morphogenic activator may be used for 
promoting the migration, proliferation, and/or differentiation of myogenic precursor cells by 
increasmg tiie level of expression of proteins associated with myocardial phenotype. Exemplary 
methods comprise introducing a small molecule morphogenic activator tiiat regulates some 
portion or portions of a moiphogen-induced regulatory patiiway. resulting in an effective increase 
in expression or activity of myocardhim-specific protein. This may resuh either from stimulating 
an increase in tiie endogenous expression of such protein or from a decrease in the inhibition of" 
normal expression of such protein. For example, a small molecule morphogenic activator may act 
at tiie type I or type II morphogen receptor; or at the serine/threonine kinase, or oUier kinase 
domains of tiiose receptors. Another target of pathway activation is the Smad proteins, including 
tiie monomeric, dimeric (including heteromeric and homomeric complexes) or trimeric forms 
(including heteromeric and homomeric complexes). The Smads have been characterized, and are 
known in the art See, e.g., Baker, et al., Curr. On GenRt n^olnp r. 467-473 (1997), 
incorporated by reference h^ein. 

Alternately, a small molecule morphogenic activator may lead to activation of a 
transcription factor (for example, the X-protein shown in Figure 2) tiiat causes phenotype-specific 
gene expression (i.e., expression of protein characteristic of myocardium). A smaU molecule 
morphogenic activator may act to fiidlitate. mimic, or. if desired, prevent any one or several of 
tiie following: type I and/or type n receptor binding, phosphorylation of tfie type I receptor, 
phosphorylation of tiie Smad molecules, Smad complex formation, Smad translocation into the 
nucleus, nuclear accumulation of tiie Smad complex, or transcription modulation of tiie Smad 
complex. Furthermore, a small molecule morphogenic activator may act on Smads or Smad 
complexes to alter tertiary structure, thereby to facilitate or inhibit interaction of the Smad or 
Smad complex with a receptor kinase domain, otiier Smads, DNA binding proteins, or DNA 



itsdf 



W098/27»5 



PCT/US97/23611 



-33 



In a particularly-preferred embodiment, a small molecule morphogenic activator is 
contacted with myogenic precursor cells in vivo or in vitro, or is administered to a patient, 
wherem the small molecule morphogenic activator facilitates formation of Smad complexes, 
particularly complexes comprising molecules of Smadl, Smad2, SmadS, Smad4, SmadS and/or 
5 Smad8 in order to induce myogenic precursor cells to migrate, proliferate and/or diflferentiate bto 
cells expressing markers of a myocardial tissue phenotype. Also in a preferred embodiment, 
methods comprise administering a smaU molecule morphogenic activator composition that 
activates a serineAhreonine kinase domain associated with a morphogen type I or type H receptor, 
thereby to activate the pathway involved in morphogen-induced gene expression. In another 

10 embodiment, methods of the invention comprise activating Sniad4 association with Smadl , 

thereby to induce morphogen-responsive phenotype. Methods of the invention may also facilitate 
Smad mteraction with specific nucleic adds, such as promoters of myocardial tissue phenotype- 
specific gene expression (i.e., expression of genes for a phenotypic protein; a protein associated 
with preservation, restoration, or enhancement of phenotype, including a protein which is critical 

15 for production of non-protein phenotypic markers, such as characteristic lipids or carbohydrates; a 
protein associated with performance of a phenotypic function or morphology; or a morphogen). 
Such interaction may be, for example, in association with a transcription control factor tiiat is 
capable of binding to a regulatory portion of a gene and, simultaneously, to one or more 
regulatory proteins such as a Smad complex (See Figure 2). 

20 An exemplary morphogen-activated pathway is shown in Figure 2. Morphogens are 

Ugands for the type I and type H receptors. Following phosphorylation of tiie type I receptor by 
the type=-n receptor, the type I receptor specificaUy phosphorylates Smadl homodimers. The type 
I receptor also specifically phosphorylates SmadS homodimers. The homodimers tiien separate to 
form, in association with a phosphorylated Smad4 molecule, a phosphorylated heteromeric 

25 complex comprising at least a Smadl and a Smad4. A phosphorylated Smadl/Smad5/Smad4 
hetra-otrimer may alternatively be formed. The heteromeric complex then translocates into the 
nucleus, and accumulates therein. In die nucleus, the Smad complex binds operative DNA, either 
alone or in association with a specific DNA binding protein (the X-protein in Figure 2), to initiate 
DNA transcription. The "X-protein" acts as a DNA-binding protein, binding the Smad 

30 heteromeric complex to the DNA. The Smadl, Smad2. Smad3 and SmadS proteins consist of 
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conserved amino- and carboxy-terminal domains linked by a region that is more divergent among 
the Smads. The carboxy-terminal domain has an eflfector function. The amino-terminal domain 
interacts physically with the carboxy-termmal domain, inhibiting its effector activity, and 
contributes to DNA binding. Receptor-mediated phosphorylation of the serine residues at the end 
5 of the carbon-terminal domain relieves the carboxy-terminal domain from the inhibitory action of 
the amino-terminal donuiin. Phosphoiylated Smad molecules form a heteromeric complex with at 
least one other specific Smad family molecule. The resulting Smad complex then translocates into 
and accumulates m the cell nucleus. There, the heteromeric Smad complexes regulate 
transcriptional responses either alone or by specific interaction with a DNA-binding protein, such 

10 as forkhead activin signal transducer-1 (FASTI). 

Other intracelhilar pathways are induced by morphogens, and may be affected in the 
manner described herein by use of a small molecule morphogenic activator. 

In a preferred embodiment, a small molecule morphogenic activator for use in the 
invention is a compound that affects one or more intracellular pathways that normally are under 

15 morphogen regulation. Such small molecule morphogenic activators preferably have the ability to 
enter the cell and target one or more intracellular pathway components in order to stimulate or 
inhibit their activity. For example, a small molecule morphogenic activator that promotes Smad 
complex formation between Smadl, Smad4, and SmadS will stimulate pathways leading to 
expression of genes encoding phenotype-specific proteins. 

20 One way in which to identify a candidate small molecule morphogenic activator is to assay 

for the ability of the candidate to modulate the effective systemic or local concentration of a 
morphogen. This may be done, for example, by incubating the candidate in a cell culture that 
produces the morphogen, and assaying the culture for a parameter indicative of a change in the 
production level of tiie morphogen according the methods of U.S.S.N. 08/451,953 and/or U.S. 

25 5,650,276, the teachings of each of which are incorporated by reference herein. Alternatively, 
candidate compounds are screened for their ability to induce phenotype-specific protein 
production in a cell culture in which morphogen activity is not present. Examples of 
compositions which may be screened for their effect on the production of morphogens or other 
phenotype-specific proteins include but are not Umited to chemicals, biological response modifiers 

30 (e.g., lymphokines, cytokines, hormones, or vitamins), plant extracts, microbial broths and 
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extracts medium conditioned by eukaryotic cells, body fluids, or tissue extracts. Useful candidate 
compositions then may be tested for in vivo efficacy in a suitable animal model These 
compositions then may be used in vivo to up-regulate morphogen-activated regulatory pathways 
of phenotype-specific protein expression. 

A simple method of determining if a small molecule composition has effected a change in 
the level of a phenotype-specific protein in cultured cells is provided in co-owned, co-pending 
patent appUcation, U.S.S.N. 08/451,953, the disclosure of which is incorporated by reference 
herein. The level of a target pheno^e-specific protein in a cell resulting from exposure to a 
small molecule is measured. Alternatively, a change in the activity or amount of an intracelhilar 
pathway component is measured in response to application of a candidate small molecule. 
Candidates having the desired aflFect on protein production or pathway regulation are selected for 
use in methods of the invention. If, for example, a composition up-regulates the production of 
OP-1 by a kidney cell line, it would then be desirable to test systemic administration of this 
compound in an animal model to determine if it up-regulates the production of OP-1 in vivo . 
The level of morphogen in the body may be a result of a wide range of physical conditions, e.g., 
tissue degeneration such as occurs in diseases including arthritis, emphysema, osteoporosis, 
kidney diseases, lung diseases, cardiomyopathy, and cirrhosis of the liver. The decrease in level of 
morphogens in the body may also occur as a result of the normal process of aging. The same 
strategy is used for compositions affecting intracelhilar pathway components. A composition 
selected by these screening methods is then used as a treatment or prophylactic. 

An appropriate test cell is any cell comprismg DNA defining a morphogen-responsive 
transcription activating element operatively associated with a reporter gene encoding a detectable 
phenotype-specific gene product. Such DNA can occur naturally m a test cell or can be a 
transfected DNA. The induced intracellular effect typically is characteristic of morphogenic 
biological activity, such as Smad activation, or activation of a cascade of biochemical events, such 
as described above, or involving, for example, cyclic nucleotides, diacylglycerol, and/or and other 
indicators of intracellular signal transduction such as activation or suppression of gene expression, 
including induction of mRNA resultmg from gene transcription and/or induction of protein 
synthesis resulting from translation of mRNA transcripts indicative of tissue morphogenesis. 
Exemplary morphogen-responsive cells are preferably of mammalian origin and include, but are 
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not limited to, osteogenic progenitor ceUs; calvaria-derived cells; osteoblasts; osteoclasts; 
osteosarcoma cells and cells of hepatic or neural origin. Any such morphogen responsive cell can 
be a suitable test cell for assessing whether a candidate substance is a smaU molecule moiphogenic 
activator. 

A preferred identification method is carried out by exposing a test cell to at least one 
candidate substance, and detecting whether such exposure induces expression of the detectable 
phenotype-specific gene product that is in operative association with the morphogen-responsive 
transcription activating element. Expression of this gene product indicates that the candidate 
substance induces a morphogen-mediated biological effect Skilled artisans can, in light of 
guidance provided herein, construct a test cell with a responsive element from a morphogen- 
responsive cell and a reporter gene of choice, using recombinant vectors and transfection 
techniques well-known in the art. There are numerous well-known reporter genes useful herein. 
These include, for example, chloramphenicol acetyltransferase (CAT), hiciferase, human growth 
hormone (hGH), beta-galactosidase, and assay systems and reagents which are available through 
commercial sources. As will be appreciated by skilled artisans, the listed reporter genes represent 
only a few of the possible reporter genes that can be used herein. Examples of such reporter 
genes can be found in Ausubel et al., Eds., Current Protocols in Molecular Biology, John Wiley & 
Sons, New York, (1989). Broadly, any gene that encodes a detectable product, e.g., any product 
having detectable enzymatic activity or against which a specific antibody can be raised, can be 
used as a reporter gene in the presait identification method. 

A currenUy preferred reportar gene system is the firefly ludferase reporter system. Gould, 
et al.. Anal. Bipchem., 7:404-408 (1988). incorporated herein by reference. The hiciferase assay 
is fest and sensitive. In this assay system, a lysate of the test cell is prepared and combined with 
ATP and the substrate hici^n. The encoded enzyme luciferase catalyzes a rapid, ATP- 
dependent oxidation of the substrate to generate a hght-ranitting product. The total light output is 
measured and is proportional to the amount of luciferase present over a wide range of enzyme 
concentrations. CAT is another frequently used reporter gene system; a Aajor advantage of this 
systan is that it has been an extensively validated and is widely accepted as a measure of 
promoter activity. Gorman , et al., Mol. Cell. Biol . 2:1044-1051 (1982), incorporated by 
reference herein. In this system, test cells are transfected with CAT expression vectors and 



wo 98/27995 



PCT/US97/23611 



-37- 

incubated with the candidate substance within 2-3 days of the initial transfection. Thereafter, cell 
extracts are prepared. The extracts are incubated with acetyl CoA and radioactive 
chloramphenicol. Following the incubation, acetylated chloramphenicol is separated from 
nonacetylated form by thin layer chromatography. In this assay, the degree of acetylation reflects 
the CAT gene activity with the particular promoter. 

Another suitable reporter gene system is based on immunologic detection of hCH. This 
system is also quick and easy to use. Sddwi, et al., Mol. Cell. Biol 6:3173-3179 (1986), 
incorporated by reference herein. The hGR system is advantageous in that the expressed hGH 
polypeptide is assayed in the media, rather than in a ceU extract. Thus, this system does not 
require the destruction of the test ceUs. It wUl be appreciated that the principle of this reporter 
gene system is not limited to hGH but rather adapted for use with any polypeptide for which an 
antibody of acceptable specificity is available or can be prepared. 

A small molecule morphogenic activator composition may up-regulate a morphogen- 
activated pathway by acting at any one or more point. For example, small molecule morphogenic 
activator potentiation of the pathway may be initiated at the receptor level. Depending on the 
pathway, the transmembrane receptors may be type I and/or type II, or may be comprise 
variations on either type I or type U receptors. For racample, OP-1 is capable of activating 
regulatory pathways comprising at least two variations of both type I and type n receptors (ActR- 
1 and BMPR-IB, and ActRn and BMPR-n, respectively). A small molecule morphogenic 
activator may stimulate the pathway by acting as a ligand and binding to any of the receptors, 
thereby inducing phosphorylation of type I receptors and/or Smad molecules. Similarly, a small 
molecule morphogenic activator may activate the regulatory pathway at the level of the 
serine/threonine kinase domain of the receptors, thereby stimulating phosphorylation of type I 
receptors and/or Smad molecules. 

As a &rther alternative, a small molecule morphogenic activator may activate the 
regulatory pathway at the level of Smad complex formation. A small molecule morphogenic 
activator may stimulate the formation of Smad femily homodimers, heterodimers, or other 
homomaic or heteromeric complexes. Furthermore, a small molecule morphogenic activator may 
activate the pathway by interacting with a Smad molecule or Smad complex, thereby altering its 
tertiary structure. 
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Alternatively, or in addition, a small molecule moiphogenic activator may activate the 
regulatory pathway by facilitating translocation of a Smad molecule or Smad complex or 
accumulation of the Smad molecule or Smad complex within the nucleus of the cell. By acting as 
a DNA binding protein or a transcriptional activator, a small molecule moiphogenic activator may 
activate the regulatory pathway by increasing transcriptional activity caused by the Smad 
molecule or Smad complex. 

Furthermore, a smaU molecule morphogenic activator can act to stimulate the regutetoiy 
pathway by interfering with an inhibitor of the pathway. For example. Smad6 and Smad7, which 
are structurally different than Smadl, Smad2, Smad3 and Smad5, act as inhibitors of certain types 
of desirable phenotype-specific protein expression (e.g., by activating TGF-p to induce scar tissue 
formation). Smad6 forms a stable association with type I receptors and interferes with the 
phosphorylation of other Smad proteins, including Smad2 and Smad 1, and their subsequent 
heteromerizationwithSmad4. Smad? also forms a stable association with activated type I 
receptors and blocks access and phosphorylation of certain Smad molecules, thereby preventing 
15 formation of certain Smad heteromeric complexes. Smad7 also inhibits nuclear accumulation of 
Smad heteromeric complexes. A small molecule morphogenic activator may interfa-e with the 
inhibitory activity of these Smad proteins by, for example, tightly binding to either one or both 
protems and rendering either protein incapable of stable association with type I receptors, or by 
competitively binding and stimulating the morphogen-activated transmembrane receptors. 
Alternatively, a smaU molecule morphogenic activator may activate the inhibitory effects of these 
Smads in order to inhibit an undearable effect (e.g., TGPP activity). 
E. Subiects for Treatintwit 

As a general matter, the methods of the present invention may be utilized for any 
mammalian subject at risk of, or afflicted with, loss of or damage to myocardium. Mammalian 

25 subjects which may be treated according to the methods of the invention include, but are not 
limited to, human subjects or patients. In addition, however, the invention may be employed in 
the treatment of domesticated mammals which are maintained as human companions (e.g., dogs, 
cats, horses), which have signijBcant commercial value (e.g., dairy cows, beef cattle, sporting 
animals), which have significant scientific value (e.g., captive or free specimens of endangered 

30 species), or which otiierwise have value. In addition, as a general matter, the subjects for 
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treatment with the methods of the present invention need not present indications for morphogen 
treatment other than those associated with loss of or damage to n^ocardium. That is, the 
subjects for treatment generally are expected to be otherwise free of indications for moiphogen 
treatment. In some number of cases, however, the subjects may present with other symptoms 
(e.g., osteoporosis, chronic renal failure) for which morphogen treatment also would be indicated. 
In such cases, the morphogen treatment should be adjusted accordingly to avoid excessive dosing. 

One of ordinary skill in the medical or veterinary arts is trained to recognize subjects at 
risk of, or afflicted with, loss of or damage to myocardium. In particular, clinical and non-clinical 
indications, as well as accumulated experience, relating to the presently disclosed and other 
methods of treatment, are expected to inform the skilled practitioner in deciding whether a given 
individual is a subject at risk of, or afflicted with, loss of or damage to myocardium and whether 
any particular treatment is best suited to the subject's needs, including treatment according to the 
present invention. 

As a general matter, a mammalian subject may be regarded as a subject at risk of, or 
aflOicted with, loss of or damage to myocardium if that subject has already been diagnosed as at 
risk of, or afflicted with, loss of or damage to myocardium. Such subjects include, but are not 
limited to, those which have already suffered a myocardial mfarction, which have suffered a 
physical trauma to the heart, or which have been diagnosed with congestive heart feihire. 
Formulatio ns and Methods of Treatment 

The morphogens, morphogen induces, agonists of morphogen receptors, or small 
molecule morphogenic activators of the present invention may be provided to myogenic precursor 
ceUs by any suitable means, preferably directly (e g., in vitro or locally after implantation, as by 
addition to culture medium, injection or topical administiation to a tissue locus) or systemically 
(e.g., parenteraUy or orally). Preferably, Uie morphogen, morphogen inducer, agonist of a 
morphojgen receptor, or small molecule morphogenic activator comprises part of an aqueous, 
physiologically acceptable solution so that in addition to delivery of tiie desired agent to the target 
cdls, the sohition does not otherwise adversely afiFect the cells' or subject's electrolyte and/or 
vohmie balance. The aqueous medium for the agent thus may comprise normal physiologic saline 
(e.g., 9.85% NaCl, 0.1 5M, pH 7-7.4). Such an aqueous solution containing the agent can be 
made, for example, by dissolving or dispersing the agent in 50% ethanol containing acetpnitrile in 
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0.1% trifluoroacetic acid (TFA) or 0.1% HCl, or equivalent solvents. One volume of the 
resultant solution then is added, for example, to ten volumes of phosphate buffered saline (PBS), 
which further may include 0, 1-0.2% human serum albumin (HSA). The resultant solution 
preferably is vortexed extensively. 

For systemic administration, the morphogens, morphogen inducers, agonists of 
morphogen receptors, or small molecule morphogenic activators of the present invention may be 
administered by any route which is compatible with the particular morphogen, mducer, or agonist 
employed. Where the agent is to be provided parenterally, such as by intravenous, subcutaneous, 
intramuscular, intraorbital, ophthalmic, intraventricular, intracranial, intracapsular, intraspinal, 
intracistemal, intraperitoneal, buccal, rectal, vaginal, intranasal or by aerosol administration, the 
agent preferably comprises part of an aqueous solution. In addition, administration may be by 
periodic injections of a bohis of the morphogen, inducer, agonist, or small molecule morphogenic 
activator, or may be made more continuous by intravenous or intraperitoneal administration from 
a reservoir which is external (e.g., an i.v. bag) or internal (e.g., a bioerodable implant, or a colony 
of implanted, morphogen-producing cells). 

If desired, a given morphogen or other agent may be made more soluble by association 
with a suitable molecule. For example, association of the mature morphogen dimer with the pro 
domain results in the pro form of the morphogen which typically is more soluble or dispersible in 
physiological solutions than the corresponding mature form. In fact, endogenous morphogens are 
thought to be transported (e.g., secreted and circulated) in the mammalian body in this form. This 
soluble form of the protein can be obtained from culture medium of morphogen-secreting 
mammalian cells, e.g., cells transfected with nucleic acid encodmg and competent to express the 
morphogen. Alternatively, a soluble species can be formulated by complexing the mature dimer 
(or an active fragment thereof) with a morphogen pro domain or a solubility-enhancing fragment 
thereof (described more fiilly below). Another molecule capable of enhancing solubility and 
particularly useful for oral administrations, is casein. For example, addition of 0.2% casein 
increases solubility of the mature active form of OPl by 80%. Other components found in milk 
and/or various serum proteins also may be useful. 

Useful solutions for parenteral administration may be prepared by any of the methods well 
known in the pharmaceutical art, described, for example, in Remington's Pharmaceutical Sciences 
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(Gennaro, A., ed.), Mack Pub., 1990. Fonnulations of the therapeutic agents of the invention 
may include, for example, polyalkylene jglycols such as polyethylene glycol, oils of vegetable 
origin, hydrogenated naphthalenes, and the like. Fonnulations for direct administration, in 
particular, may include glycerol and other compositions of high viscosity to help mamtain the 
5 agent at the desired locus. Biocompatible, preferably bioresorbable, polymers, inchiding, for 
example, hyaluronic acid, coUagen, tricalcium phosphate, polybutyrate, lactide, and glycoHde 
polymers and lactide/glycolide copolymers, may be useful exdpients to control the release of the 
agent inwro. Other potentially useful parenteral delivery systems for these agents include 
ethylene-vinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, and 
10 Uposomes. Formulations for inhalation administration contain as excipients, for example, lactose, 
or may be aqueous solutions containing, for example, polyoxyethylene-9-Iauryl ether, 
glycocholate and deoxycholate, or oily solutions for administration in the form of nasal drops, or 
as a gel to be applied intranasally. Formulations for parenteral administration may also include 
glycocholate for buccal administration, methoxysalicylate for rectal administration, or cutric acid 
15 for vaginal administration. Suppositories for rectal administration also may be prepared by mixing 
the morphogen, inducer, agonist, or small molecule morphogenic activator with a non-irritating 
excipient such as cocoa butter or other compositions which are solid at room temperature and 
liquid at body temperatures. 

Formulations for local or topical administration to a tissue or skin surfece may be prepared 
by dispersing the morphogen, inducer, agonist or smaU molecule morphogenic activator with an 
acceptable carrier such as a lotion, cream, ointment or soap. Particulariy useful are carriers 
capable of forming a fihn or layer over the skin or tissue to localize application and inhibit 
removal. For local or topical administration to internal tissue surfaces, the agent may be dispersed 
in a Bquid tissue adhesive or other substance known to enhance adsorption to a tissue surface. 
25 For example, hydroxypropylcelhilose or fibrinogen/thrombin solutions may be used to advantage. 
Altranatively, tissue-coating solutions, such as pectin-containing formulations may be used. 

Alternatively, the agents described herein may be administered orally. Oral administration 
of proteins as tiierapeutics generaUy is not practiced as most proteins are readily degraded by 
digestive enzymes and acids in the mammalian digestive system before they can be absorbed into 
30 the bloodstream. However, the morphogens described herein typically are acid stable and 
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protease-resistant (see, for example, U.S. Pat. No. 4,968.590). In addition, at least one 
morphogen, OPl, has been identified in mammaiy gland extract, colostrum and 57-day mUk. 
Moreover, the OPl purified firom mammaiy gland extract is morphogenically active and also is 
detected in the bloodstream. Maternal administration, via ingested milk, may be a natural delivery 
5 route of TGPP superfamily proteins. Letterio et al. (1994), Science 264: 1 936-1938, report that 
TGFp is present in murine milk, and that radiolabeled TGFp is absorbed by gastrointestinal 
mucosa of suckling juveniles. Labeled, ingested TGF3 appears rapidly in intact form in the 
juvenUes' body tissues, inchiding lung, heart and liver. Fmally, soluble form morphogen, e.g., 
mature morphogen associated with the pro domain, is morphogenically active. These findings, as 

10 well as those disclosed in the examples below, indicate that oral and parenteral administration are 
viable means for administering TGFp superfamily proteins, including the morphogens, to an 
individual. In addition, while the mature forms of certain morphogens described herein typically 
are sparingly soluble, the morphogen form found in milk (and mammary gland extract and 
colostrum) is readily soluble, probably by association of the mature, morphogenicaUy active form 

15 witii part or aU of the pro domain of the intact sequence and/or by association with one or more 
milk componMits. Accordingly, the compounds provided herdn also may be associated with 
molecules capable of enhancing their sohibility in vitro or in vivo . 

The compounds provided herein also may be associated with molecules capable of 
targeting tiie morphogen, inducer, agonist or small molecule morphogenic activator to the desired 

20 tissue. For orample, an antibody, antibody fi:agment, or otiier binding protein tiiat interacts 

^ificaUy with a surfece molecule on cells of tiie desired tissue, may be used. Useful targeting 
molecules may be designed, for example, usmg tiie single chain binding site technology disclosed, 
for example, in U.S. Pat. No. 5,091,513. Targeting molecules can be covalently or non- 
covalently associated ivith tiie morphogen, inducer, agonist, or small molecule morphogenic 

25 activator. 

As will be appreciated by one of ordinary skill in the art, the formulated compositions 
contain therapeutically effective amounts of the morphogen, morphogen inducers, agonists of 
morphogen receptors, or small molecule morphogenic activators. That is, they contain amounts 
which provide appropriate concentrations of the agent to the mammalian myogenic precursor cells 
30 for a time suflBcient to stimulate morphogenesis of new and functional myocardhim, and/or to 
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prevent, inhibit or delay further significant loss of myocardium or decline of myocardial function. 
As will be appreciated by those skilled in the art, the concentration of the compounds described in 
a therapeutic composition of the present invention will vary dependmg upon a number of factors, 
including the biological efficacy of the selected agent, the chemical characteristics (e.g., 
5 hydrophobicity) of the compounds employed, the formulation of the compound exdpients, the 
administration route, and the treatment envisioned, including whether the active ingredient will be 
administered directty to cells jn vitro, directly into a tissue site, or systemically. The preferred 
dosage to be administered also is likely to depend on such variables such as the condition of the 
diseased or damaged tissues, and the overall health status of the particular subject. 
10 As a general matter, for systemic administration, daily or weekly dosages of 0.00001 - 

1000 mg of a morphogen are sufficient, with 0.0001-100 mg being preferable, and 0.001 to 10 mg 
being even more preferable. Alternatively, a daily or weekly dosage of 0.0 1 - 1 000 ng/kg body 
weight, more preferably 0. 1-100 \xg/kg body weight, may be advantageously employed. Dosages 
are preferably administered continuously, but daily, multi-weekly, weekly or monthly dosages may 
15 also be employed. In addition, in order to facilitate frequent infusions, implantation of a semi- 
permanent stent (e.g., intravenous, intraperitoneal, intracisternal or intracapsular) may be 
advisable. It should be noted that no obvious morphogen Induced pathological lesions arise when 
mature morphogen (e.g., OPl, 20 mg) is administered daily to normal groAving rats for 
21 consecutive days. Moreover, 10 mg systemic injections of morphogen (e.g., OPl) injected 
20 daily for 1 0 days into normal newborn mice does not produce any gross abnormalities. 

The morphogens, inducws, agonists or small molecule morphogenic activators of the 
invention may, of course, be administered alone or in combination with other molecules known to 
be beneficial in the treatment of the conditions described herein. Thus, in other embodiments the 
present invention provides pharmaceutical compositions in which a morphogen, morphogen 
25 inducer , agonist of a morphogen receptor, or small molecule morphogenic activator is combined 
with other agents which promote or enhance the proliferation and differentiation of myogenic 
precursor cells into new and functional myocardium. Thus, the present invention provides 
pharmaceutical compositions comprising a morphogen, or morphogen inducer, or agonist of a 
morphogen receptor, or small molecule morphogenic activator, in combination witii one or more 
30 of a "muscle extract," conditioned medium from differentiated myotubes grown in culture, bFGF, 



wo 98/27995 



PCT/US97/23611 



-44- 

IGF. PDGF, LIF, ACTH, MSH, or G-CSF. In each such composition, the ratios or the 
morphogenic and mitogenic agents njay be adjusted based upon their activities, as disclosed in the 
literature or as determined through simple experimentation, to provide a therapeutically eflFective 
dosage of each compound in a single unit dosage. The morphogenic and mitogenic agents in such 
a composition each preferably comprise at least about 1%, and more preferably more than 5% or 
10%, of the dry weight of the conaposition. The conq)ositions may, however, include other 
pharmaceutical carriers and active agents, as described above and, generally, in Remington's 
Pharmaceutical Sciences (Gennaro, A., ed.). Mack Pub., 1 990, and, therefore, the morphogenic 
and mitojgenic agents may each comprise a small fraction of the final weight of the pharmaceutical 
composition. 

Practice of the invention, including additional preferred aspects and embodiments thereof, 
will be still more liilly understood from the following examples, which are presented herein for 
ilhistration only and should not be construed as limiting the invention in any way. 

Examples 

Preparati on of Soluble Morphogen Complexes 

A currently preferred form of the morphogen useful herein, having improved sohibility in 
aqueous solutions, is a dimeric morphogenic protein comprising at least the C-terminal seven 
cystdne domain characteristic of the morphogen family, complexed with a peptide compriang a 
pro region of a membo- of the morphogen femily, or a solubility-enhandng fragment thereof or 
an aUeUc, species or other sequence variant thereof Preferably, the dimeric morphogenic protein 
is conq)lexed vwth two pro region peptides. Also, the dimeric morphogenic protein preferably is 
noncovalentty complexed with the pro repon peptides. The pro region peptides preferably 
comprise at least the N-terminal dghteen amino acids that define the pro domain of a given 
naturally occurring morphogen, or an allelic or phylogenetic counterpart variant thereof In other 
preferred embodiments, peptides defining substantially the fiill length pro domain are used. 

Other soluble forms of morphogens include dimers of the uncleaved pro forms of these 
proteins, as well as "hemi-dimers" wherein one subunit of the dimer is an uncleaved pro form of 
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the protein, and the other subunit comprises the mature form of the protein, including truncated 
forms thereof, preferably noncovalently associated with a cleaved pro domain peptide. 

As described above and in published application WO94/03600, the teachings of which are 
incorporated herein by reference, useful pro domains include tiie full lengtii pro regions, as well as 

5 various truncated forms hereof, particularly truncated fortns cleaved at proteolytic Arg-Xaa-^^ 
Arg cleavage sites within tiie pro domain polypeptide. For example, in OPl, ppssible pro 
sequences include sequences defined by residues 30-292 (full lengtii fonn); 48-292; and 1 58-292. 
Sohible OPl complex stability is best enhanced when tiie pro region comprises tiie full length 
fonn ratiier tiian a truncated form, such as tiie residues 48-292 truncated form, in that residues 

10 30-47 show sequence homology to tiie N-tenninal portions of otiier morphogens, and currentiy 
are believed to have particular utility in enhancing complex stability for all morphogens. 
Accordingly, currently preferred pro domains include peptides comprising at least the N-terminal 
fi-agment, e.g.. amino acid residues 30-47 of a naturally occuning morphogen pro domain, or a 
biosyntiietic variant thereof tiiat retains tiie solubiUty and/or stability enhancii^ properties of tiie 

15 naturally-occurring peptide. . 

As will be appreciated by tiiose having ordinary skill in the art, useful sequences encoding 
tiie pro region can be obtained fi-om genetic sequences encoding known morphogens. 
Alternatively, chimeric pro regions can be constructed from tiie sequences of one or more known 
morphogens. Still anotiier option is to create a syntiietic sequence variant of one or more known 

20 pro region sequences. 

In anotiier preferred aspect, useful pro region peptides inchide polypeptide chains 
comprising an amino acid sequence encoded by a nucleic acid tiiat hybridizes under stringent 
conditions witii a DNA or RNA sequence encoding at least tiie N-terminal eighteen amino adds 
of tiie pro region sequence for OPl or 0P2, e.g., nucleotides 136-192 and 152-21 1 of SEQ ID 

25 NOs: IS and 19, respectivdy. 

A. Isolation from conditioned media or body fluid 

Morphogens are expressed from mammalian cells as soluble complexes. Typically, 
however tiie complex is disassociated during purification, generally by exposure to denaturants 
oftra added to tiie purification solutions, such as detergents, alcohols, organic solvents, 
chaotropic agents and compounds added to reduce tiie pH of the solution. Provided below is a 
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currently preferred protocol for purifying the soluble proteins from conditioned media (or, 
optionally, a body fluid such as serum, cerebrospinal or peritoneal fluid), under non-denaturing 
conditions. The method is rapid, reproducible and yields isolated soluble morphogen complexes 
in substantially pure form. 

5 Soluble morphogen complexes can be isolated from conditioned media using a simple, 

three step chromatographic protocol performed in the absence of denaturants. The protocol 
involves running the media (or body fluid) over an afiBnity cohimn, foUowed by ion exchange and 
gel filtration chromatographies. The aBBnity column described below is a Zn-IMAC column. The 
present protocol has general applicability to the purification of a variety of moiphogens, all of 
10 which are anticipated to be isolatable using only minor modifications of the protocol described 
below. An alternative protocol also envisioned to have utility includes an immunoaflBnity column, 
created using standard procedures and, for example, using antibody specific for a given 
morphogen pro domain (complexed, for example, to a protein A-conjugated Sepharose cohimn). 
Protocols for developing immunoaflBnity columns are weU described in the art (see, for example, 
guide to Protein Purification , M. Deutscher, ed.. Academic Press, San Diego, 1 990, particularly 
sections VII and XI thereof). 

In this study, OPl was expressed in mammalian (CHO, Chinese hamster ovary) cells as 
described in the art (see, for example, international application US90/05903 CWO91/05802). The 
CHO cell conditioned media containing 0.5% FBS was initially purified using Immobilized Metal- 
20 Ion Affinity Chromatography (IMAC). The sohible OP 1 complex from conditioned media binds 
very selectively to the Zn-IMAC resin and a high concentration of imidazole (50 mM imidazole, 
pH 8.0) is required for the effective ehition of the bound complex. The Zn-IMAC step separates 
the soluble OPl from the bulk of the contaminating serum proteins that ehite in the flowthrough 
and 35 mM imidazole wash fi^ctions. The Zn-IMAC purified soluble OPl is "next applied to an S- 
Sepharose cation-exchange colunm equilibrated in 20 mM NaP04 (pH 7.0) with 50 mM NaCI. 
This S-Sepharose stq> serves to fiirther purify and concentrate the soluble OPl complex in 
preparation for the following gel fihration step. The protein was applied to a Sephacryl S-200HR 
column equilibrated in TBS. Using substantially the same protocol, soluble morphogens also can 
be isolated from one or more body fluids, including serum, cerebrospinal fluid or peritoneal fluid. 
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IMAC was performed using Chelating-Sepharose (Pharmacia) that had been charged with 
three column volumes of 0.2 M ZnSO*. The conditioned media was titrated to pH 7.0 and 
applied directly to the Zn-IMAC resin equilibrated in 20 mM HEPES .(pH 7.0) with 500 mM 
NaCl. The Zn-IMAC resin was loaded with 80 mL of starting conditioned media per mL of resin. 
5 After loading, the column was washed with equilibration buffer and most of the contaminating 
proteins were ehited with 35 mM imidazole (pH 7.0) in equiUbration buflFer. The soluble OPl 
complex then is eluted with 50 mM imidazole (pH 8.0) in 20 mM HEPES and 500 mM NaCl. 

The 50 mM imidazole eluate containing the soluble OPl complex was diluted with nine 
volumes of 20 mM NaPO* (pH 7.0) and applied to an S-Sepharose (Pharmacia) column 
10 equilibrated in 20 mM NaPO* (pH 7.0) with 50 mM NaCl. The S-Sepharose resin was loaded 
with an equivalent of 800 mL of starting conditioned media per mL of resin. After loading, the S- 
Sepharose cohmin was washed with equilibration buffer and eluted witii 1 00 mM NaCl followed 
by 300 mM and 500 mM NaCl in 20 mM NaP04 (pH 7.0). The 300 mM NaCl pool was fiirther 
purified using gel filtration chromatography. Fifty mis of the 300 mM NaCl eluate was applied to 
a 5.0 X 90 cm Sephacryl S-200HR (Pharmacia) equUibrated in Tris buflFered saline (TBS), 50 mM 
Tris, 150 mM NaCl (pH 7.4). The column was eluted at a flow rate of 5 mL/minute collecting 10 
mL fi-actions. The apparent molecular mass of tiie soluble OPl was determined by comparison to 
protein molecular weight standards (alcohol dehydrogenase (ADH. 150 IcDa), bovine serum 
albumin (BSA, 68 kDa), carbonic anhydrase (CA, 30 kDa) and cytochrome C (cytC, 12.5 kDa). 
20 The purity of the S-200 cohimn fractions was detramined by separation on standard 1 5% 

polyacrylamide SDS gels stained with Coomassie bhie. The identity of tiie mamre OPl and tiie 
pro-domain was determined by N-termmal sequence analysis after separation of the mature OPl 
fi-om the pro-domain using standard reverse phase CI 8 HPLC. 

The soluble OPl complex ehites with an apparent molecular weight of 1 10 kDa. This 
25 agrees weU with tiie predicted compoation of tiie soluble OPl complex with one mature OPl 
dimer (35-36 kDa) associated witii two pro-domains (39 kDa each). Purity of the final complex 
can be verified by running the appropriate fraction in a reduced 1 5% polyacrylamide gel. 

The complex components can be verified by running the complex-containing fraction from 
the S-200 or S-200HR columns over a reverse phase C 1 8 HPLC column and eluting in an 
acetonitrile gradient (in 0. 1% TFA), using standard procedures. The complex is dissociated by 
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this step, and the pro domain and mature species ehite as separate species. These separate species 
then can be subjected to N-terminal sequencing using standard procedures (see, for example. 
Guide to Protein Purification, M. Deutscher, ed.. Academic Press, San Diego, 1990, particularly 
pp. 602-613). and the idenUty of the isolated 36 kDa, 39 kDa proteins confirmed as mature 
5 morphogen and isolated, cleaved pro domain, respectively. N-terminal sequencing of the isolated 
pro domain from mammalian cell produced OPl revealed two forms of the pro region, the intact 
form (beginning at residue 30 of SEQ ID NO: 16) and a truncated fonn, (begimiing at residue 48 
of SEQ ID NO: 16.) N-teiminal sequendng of the polypeptide subunit of the isolated mature 
species reveals a range of N-termini for the mature sequence, beginning at residues 293, 300, 313, 
10 315, 316, and 318, of SEQ ID NO: 16, all of which are active, as demonstrated by the standard 
bone morphogenesis assay set forth in published application W092/15323 as incorporated herein 
by reference. 

B- In Vitro S oluble Morphoeen Complex Formation 

As an alternative to purifying soluble complexes fi-om culture media or a body fluid, 
1 5 soluble complexes can be formulated fi-om purified pro domains and mature dimeric species. 
Successfijl complex formation ^parently requires association of the components under 
denaturing conditions sufficient to relax the folded structure of these molecules, without affecting 
disulfide bonds. Preferably, the denaturing conditions mimic the environmrait of an intracellular 
vesicle sufficiently such that the cleaved pro domain has an opportunity to associate with the 
20 mature dimeric species under relaxed folding conditions. The concentration ofdenaturant in the 
solution then is decreased in a controlled, prefen^ly step-wise manner, so as to aUow proper 
refolding of the dima- and pro regions while maintaining the association of the pro domain with 
the dimer. Usefiil dematurants inchide 4-6M urea or guanidine hydrochloride (GuHCl), in 
buffered solutions of pH 4-10, preferably pH 6-8. The soluble complex then is formed by 
25 controlled dialysis or dilution into a solution having a final denaturant concentration of less than 
0.1-2M urea or GuHCl, preferably 1-2 M urea of GuHCl, which then preferably can be diluted 
into a physiological buffer. Protein purification/renaturing procedures and considerations are well 
described in the art, and details for developing a suitable renaturing protocol readily can be 
determined by one having ordinary skill in the art. One useful text on the subject is Guide to 
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Protein Purification, M. Deutscher, ed.. Academic Press, San Diego, 1990, particularly section V. 
Complex formation also may be aided by addition of one or more chaperone proteins. 
C- Stability of Soluble Morphoeen Complexes 

The stability of the highly purified soluble morphogen complex in a physiological buffer, 
5 e.g., Tris-buflFered saline (TBS) and phosphate-buffered saline (PBS), can be enhanced by any of a 
number of means. The currently preferred method is by means of a pro region that comprises at 
least the first 18 amino acids of the pro sequence (e.g., residues 30-47 of SEQ ID NO: 16 for OP- 
1), and preferably is the fiill length pro region. Residues 30-47 show sequence homology to the 
N-terminal portion of other morphogens and are beUeved to have particular utility in enhancing 
0 complex stabiUty for all morphogens. Other usefiil means for enhancing the stability of soluble 
morphogen complexes include three classes of additives. These additives include basic amino 
adds (e.g., L-arginine, lysine and betaine); nonionic detergents (e.g., Tween 80 or Nonldet P- 
120); and carrier proteins (e.g., serum albumin and casein). Useful concentrations of these 
additives include 1-100 mM, preferably 10-70 mM, including 50 mM, basic amino acid;, 0.01- 
5 1 .0%, preferably 0.05-0.2%, including 0. 1% (v/v) nonionic detergent;, and 0.01-1 .0%, preferably 
0.05-0.2%, includmg 0.1% (w/v) carrier protein. 

Equivalents 

TTie invention may be embodied in other specific forms without dq»arting fi^om the spirit 
or essential characteristics thereof The foregoing embodiments are therefore to be considered in 
0 all respects ilhistrativerathw than limiting on the invention described herein. Scope of the 

invention is thus indicated by the appended claims rather than by the foregoing description, and all 
changes which come within the meaning and range of equivalency of the daims are intended to be 
embraced therdn. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: COHEN, CHARLES M. 

(ii) TITLE OF INVENTION: TREATMENT OF MAMMALIAN MYOCARDIOM WITH 
MORPHOGENICALLY^TRBATBD MYOGENIC PRECURSOR cSSs 

(iii) NUMBER OP SEQUENCES: 31 

(iy) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: TESTA, HURWITZ & THIBEAULT. LLP 

(B) STREET: 125 HIGH STREET 

(C) CITY: BOSTON 

(D) STATE: MA 

(E) COUNTRY: USA 
(P) ZIP: 02110 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: TWOMEY, MICHAEL J 

(B) REGISTRATION NUMBER: 38,34 9 

(C) REFERENCE/DOCKET NUMBER: CRP-123 

(ix) TELECC»iMUNICATION INFORMATION: 

(A) TELEPHONE: 617/248-7000 

(B) TELEFAX: 617/248-7100 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1, .97 

(D) OTHER INFORMATION: /labels Generic-Seq-7 

/note= "wherein each Xaa is independently selected 
from a group of one or niore specified amino acids 
as defined in the specification." 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
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Leu Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa Xaa Xaa Xaa Xaa Xaa 



10 



15 



Pro xaa Xaa Xaa Xaa Ala Xaa Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Pro 
20 25 



30 



xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala Xaa Xaa Xaa Xaa Xaa 

40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Pro 
xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



70 



75 



80 



Val Xaa Leu Xaa Xaa Xaa Xaa Xaa Met Xaa Val Xaa Xaa Cys Xaa Cys 



90 



95 



Xaa 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: l.,102 

(D) OTHER INFORMATION: /label= Generic-Seq-8 

/note= "wherin each Xaa is independently selected 
from a group of one or more specified amino acids 
as defined in the specification." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Leu Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa 
5 10 15 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala Xaa Tyr Cys Xaa Gly 
20 25 30 

Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala 
40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
55 60 

Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa 
70 75 ao 

Xaa Xaa Val Xaa Leu Xaa Xaa Xaa Xaa Xaa Met Xaa Val 
85 90 95. 

Xaa Cys Xaa 
100 



Cys Xaa Xaa 
1 

Xaa Xaa Xaa 



Xaa Cys Xaa 
35 

Xaa Xaa Xaa 

50 

Xaa Cys Cys 
65 

Xaa Xaa Xaa 



Xaa Xaa Cys 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ixj FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .102 

(D) OTHER INFORMATION: /label= OPX 

/note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED 
FROM A GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS 
AS DEFINED IN THE SPECIFICATION" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Xaa Xaa His Glu Leu Tyr Val OCaa Phe Xaa Asp Leu Gly Trp Xaa 
Is 10 15 

Asp Trp xaa He Ala Pro Xaa Gly Tyr Xaa Ala Tyr Tyr Cys Glu Gly 
20 25 .30 

Glu Cys Xaa Phe Pro Leu Xaa Ser Xaa Met Asn Ala Thr Asn His Ala 
35 40 45 

He Xaa Gin Xaa Leu Val His Xaa Xaa Xaa Pro Xaa Xaa Val Pro Lys 
50 55 60 

Xaa Cys Cys Ala Pro Thr Xaa Leu Xaa Ala Xaa Ser Val Leu Tyr Xaa 
65 70 75 80 

Asp Xaa Ser Xaa Asn Val Xaa Leu Xaa Lys Xaa Arg Asn Met Val Val 
85 90 95 

Xaa Ala Cys Gly Cys His 
100 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 
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(D) OTHER INFORMATION: /label= hOPl -MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys 

1 5 10 15 . ^ 

Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser 
20 25 30 

Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arq 
35 40 45 ^ 

Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala 
50 55 60 ' 

Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn 

70 75 80 

Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn Pro 
85 90 95 

Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He 
100 105 110 

Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr 

120 125 

Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /label= MOPl-MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys 
1 5 .10 15 

Asn Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn Ser Ser Ser 
20 25 30 

Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg 
35 40 45 
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Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala 
50 55 60 

. Tyr TVr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn 

70 75 80 

Ala Thr Asn His Ala lie Val Gin Thr Leu Val His Phe He Asn Pro 
85 90 95 

Asp Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He 
100 105 110 

Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tvr 
115 120 125 

Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear • 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(P) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /labels H0P2 -MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
^ 5 10 15 

Pro Gin Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser 
20 25 30 

His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 

35 40 45 

Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 
^5 70 75 80 

Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
85 90 95 

Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 

Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 
115 120 125 
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Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /label- M0P2 -MATURE 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn Glu Leu 
1 5 10 15 

Pro His Pro Asn Lys Leu Pro Gly He Phe Asp Asp Gly His Gly Ser 
20 25 30 

Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arq 
35 40 45 

Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys Met Asn 

70 75 80 

Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
85 90 95 

Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 

Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 
115 120 125 

Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
130 135 

(2). INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bovinae 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

(D) OTHER INFORMATION: /label= CBMP-2A-FX 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
1 5 10 



15 



Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Glv 
20 25 30 

Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Ash His Ala 
35 40 . 45 • 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala 
50 . 55 60 

Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asd 
65 70 75 . 80 

Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOIO SAPIENS 
(F) TISSUE TYPE: hippocampus 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

(D) OTHER INFORMATION: /label= CBMP-2B-FX 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
^5 10 15 

Asp Trp He Val Ala Pro Pro Gly Tyr Glh Ala Phe Tyr Cvs His Glv 
20 25 30 

Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 
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He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala 
^° 55 60 

Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp 
.70 75 ^ 80 

Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: DROSOPHILA MELANOGASTER 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

(D) OTHER INFORMATION: /labels DPP-FX 
(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asp 
^ 5 10 15 

Asp Trp He Val Ala Pro Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly 
20 25 30 

Lys Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser Thr Asn His Ala 
35 40 45 

Val Val Gin Thr Leu Val Asn Asn Asn Asn Pro Gly Lys Val Pro Lys 

50 55 60 

Ala Cys Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met Leu Tyr Leu 
' 70 75 80 

Asn Asp Gin Ser Thr Val Val Leu Lys Asn Tyr Gin Glu Met Thr Val 
85 90 95 

Val Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO:li: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



wo 98/27995 



PCT/US97/23611 



58 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: XENOPUS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label= VGL-FX 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 
1 5 xO 



15 



Asn Trp Val He Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 
20 25 30 

Glu Cys Pro Tyr Pro Leu Thr Glu He Leu Asn Gly Ser Asn His Ala 
35 40 45 

He Leu Gin Thr Leu Val His Ser He Glu Pro Glu Asp He Pro Leu 
SO 55 go 

Pro Cys Cys Val Pro Thr Lys Met Ser Pro He Ser Met Leu Phe Tvr 
^5 70 75 80 

Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 
85 90 95 

Asp Glu Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID N0il2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 102 

(D) OTHER INFORMATION: /label* VGR-l-FX 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Gin Asp Val Gly Trp Gin 
1 5 10 15 

Asp Trp .He He Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 
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He Val Gin Thr Leu Val His Val Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Val Asn Ala He Ser Val Leu Tyr Phe 
" 70 75 80 

Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(P) TISSUE TYPE: brain 

( ix) FEATURE : 

(A) NAME/KEY: Protein 

(B) LOCATION: 1«.106 

(D) OTHER INFORMATION: /note- "GDF-1 (fx)" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp His 
1 5 10 15 

Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly 
20 25 30 

Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala 
35 40 45 

Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro Gly 
50 55 60 

Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser 
^5 70 75 80 

Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr Glu 
85 90 95 

Asp Met Val Val Asp Glu Cys Gly Cys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Ui) SEQUENCE DESCRIPTION: SBQ ID NO: 14: 

Cys Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1B22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 1341 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /function= "OSTEOGENIC PROTEIN" 

/product = "OPl" 
/evidence- EXPERIMENTAL 
/standard_name« "OPl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 57 

Met His Val 
1 

CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA 105 
Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala 
5 10 15 

CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC 153 
Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn 
20 25 30 ^ 35 

GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG 201 
Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arq 
40 45 50 

CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC 24 9 

Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
55 60 65 
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CCG CGC CCG CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG 297 
Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met 
70 75 80 

CTG GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC 345 
Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Glv 
85 90 95 

GGC CAG GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC 393 
Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Glv 
100 105 110 

CCC CCT CTG GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC 441 
Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp 
120 125 130 

ATG GTC ATG AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG 6AA TTC TTC 489 
Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe 
135 140 145 

CAC CCA CGC TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC 537 
His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He 
150 155 160 

CCA GAA GGG GAA OCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC 585 
Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp 
165 170 175 

TAC ATC CGG GAA CGC TTC GAC. AAT GAG ACG TTC CGG ATC AGC GTT TAT 633 
Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg lie Ser Val Tyr 
180 185 190 195 

CAG GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 681 
Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 
200 205 210 

GAC AGC CGT ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC 729 
Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp 
215 220 225 

ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG 777 
He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu 
230 235 240 

GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC 825 
Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro 
245 250 255 

AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC 873 
Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro 
260 265 270 275 

TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC 921 
Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He 
280 285 290 

CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC 969 
Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro 
295 300 305 

AAG AAC CAG GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC 1017 
Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser 
310 315 320 
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1209 



62 



AGC GAC CAG AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC 1065 
Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe 
325 330 335 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 1113 
Arg Asp Leu Gly Trp Gin Asp Trp lie He Ala Pro Glu Gly Tyr Ala 

345 350 355 

GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tvr Met 
360 365 370 

AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn 
375 380 385 

CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 1257 
Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 
390 395 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 1305 
He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys 
405 410 

TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 
Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTCG CCTTGGCCAG 
6AACCAGCAG ACCAACTGCC TTTTGTGAGA CCTTCCCCTC CCTATCCCCA ACTTTAAAGG 
TGTGAGAGTA TTAGGAAACA TGAGCAGCAT ATGGCTTTTG ATCAGTTTTT CAGTGGCAGC 
ATCCAATGAA CAAGATCCTA CAAGCTGTGC AGGCAAAACC TAGCAGGAAA AAAAAACAAC 
GCATAAAGAA AAATGGCCGG GCCAGGTCAT TGGCTGGGAA GTCTCAGCCA TGCACGGACT 1651 
CGTTTCCAGA GGTAATTATG AGCGCCTACC AGCCAGGCCA CCCAGCCGTG GGAGGAAGGG 1711 
GGCGTGGCAA GGGGTGGGCA CATTGGTGTC TGTGCGAAAG GAAAATTGAC CCGGAAGTTC 1771 
CTGTAATAAA TGTCACAATA AAACGAATGA ATGAAAAAAA AAAAAAAAAA A 1822 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
^5 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 



1351 

1411 
1471 
1531 
1591 
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Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 

35 ^ ft — • 



40 



45 



Gin Glu Arg Arg Glu Met Gin Arg Glu lie Leu Ser He Leu Gly Leu 



55 



60 



Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 

80 



.65 70 75 



Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 



90 



95 



Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro lyr Lys Ala Val Phe Ser 
XUU . 105 



110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 

115 120 

Asp Ala ASP Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 

"0 155 160 

Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 

165 " "~" 



170 



175 



Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arq He 
180 185 



190 



Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 

Phe Leu Leu Asp Ser . Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 



215 



220 



Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arq 

ZZ^ OOA 



230 



235 



240 



His Asn Leu Gly Leu Gin Leu Ser . Val Glu Thr Leu Asp Gly Gin Ser 
245 250 255 

He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin .Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 



310 



315 



320 



Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 
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Ser Tyr Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu Val His 
370 375 

Phe lle Asn Pro Glu Thr Val Pro Lyg Pro Cys Cys Ala Pro Thr Gin 

390 395 400 

Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 
405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(P) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1393 

(D) OTHER INFORMATION: /function= "OSTEOGENIC PROTEIN" 
/product^ "MOPl" 
/note= "MOPl (CDNA) » 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGCAGCAAG TGACCTCGGG TCGTGGACCG CTGCCCTGCC CCCTCCGCTG CCACCTGGGG 60 

CGGCGCGGGC CCGGTGCCCC GGATCGCGC6 TAGAGCCGGC 6CG ATG CAC GTG CGC 115 

Met His Val Arg 
1 

TCG CTG CGC GCT GCG GCG CCA CAC AGC TTC GTG GCG CTC TGG GCG CCT 163 
Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro 
5 10 15 20 

CTG TTC TTG CTG CGC TCC GCC CTG GCC GAT TTC AGC CTG GAC AAC GAG 211 
Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu 
25 30 35 

GTG CAC TCC AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG 259 
Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg Arg 
40 45 50 

GAG ATG CAG CGG GAG ATC CTG TCC ATC TTA GGG TTG CCC CAT CGC CCG 307 
Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro 
55 60 65 



CGC CCG CAC CTC CAG GGA AAG CAT AAT TCG GCG CCC ATG TTC ATG TTG 



355 
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. 451 



499 



547 



595 



643 



691 



•65 



Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu 
'0 75 80 

GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG AGO GGG CGG GAC GGA CAG 
Asp Leu T/r Asn Ala Met Ala Val Glu Glu Ser Gly Pro Asp Gly Gin 
®5 90 .95 ' 100 

GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT 
Gly Phe Ser Tyr Pro Tyr Lya Ala Val Phe Ser Thr Gin Gly Pro Pro 
105 110 

TTA GCC AGC CTG CAG GAC AGC CAT TTC CTC ACT GAC GCC GAC ATC GTC 
Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val 
"0 125 130 

ATG AGC TTC GTC AAC CTA GTG GAA CAT GAC AAA GAA TTC TTC CAC CCT 
Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro 
135 140 145 

CGA TAC CAC CAT CGG GAG TTC CGG TTT GAT CTT TCC AAG ATC CCC GAG 
Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He Pro Glu 
150 . 155 , 160 

GGC GAA CGG GTG ACC GCA GCC GAA TTC AGO ATC TAT AAG GAC TAC ATC 
Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He 

"0 175 180 

CGG GAG CGA TTT GAC AAC GAG ACC TTC CAG ATC ACA GTC TAT CAG GTG 
Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr Val Tyr Gin Val 
185 190 195 

CTC CAG GAG CAC TCA GGC AGG GAG TCG GAC CTC TTC TTG CTC GAC AGC 739 
Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser 
200 205 210 

TGG TTC GTG TTT GAT ATC ACA 787 
Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr 
215 220 225 

^ ^ °TC AAC CCT CGG CAC AAC CTC GGC TTA 835 

Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu 
230 235 240 

CAG CTC TCT GTG GAG ACC CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG 
Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu 

250 255 260 

GCA GGC CTG ATT GGA CGG CAT GGA CCC CAG AAC AAG CAA CCC TTC ATC 
Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met 
265 270 275 

GTG GCC TTC TTC AAG GCC ACG GAA GTC CAT CTC CGT AGT ATC CGG TCC 979 
Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg Ser He Arq Ser 
280 285 290 

ACG GGG GGC AAG CAG CGC AGC CAG AAT CGC TCC AAG ACG CCA AAG AAC 1027 
Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn 
295 300 305 

CAA GAG GCC CTG AGG ATC GCC AGT GTC GCA GAA AAC AGC AGC AGT GAC 1075 
Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn Ser Ser Ser Asp 
310 315 320 



883 



931 
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CAG AGG GAG GCC TGC AAG AAA CAT GAG CTG TAG GTC AGO TTC CGA GAG 
Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asd 
325 330 335 340 

CTT GGC TGG CAG GAG TGG ATC ATT GCA OCT GAA GGC TAT GCT GCC TAG 
I^u Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Tyr 
345 350 355 

TAG T6T GAG GGA GAG TGC GCC TTC CCT CTG AAC TCC TAG ATG AAC GCC 
Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala 
360 365 370 

ACC AAC CAC GCC ATC GTC CAG ACA CTG GTT CAC TTC ATC AAC CCA GAC 
Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn Pro Asp 
375 380 385 

ACA GTA GCC AAG CCC TGC TGT GCG GCC ACC CAG GTC AAC GCC ATC TCT 1315 
Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser 
390 395 400 

GTC CTG TAG TTC GAC GAC AGC TCT AAT GTC ATC CTG AAG AAG TAG AGA 1363 
Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Ara 

410 415 420 

AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCTTCC TGAGACCCTG 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 4 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 



1123 



1171 



1219 



1267 



1413 





425 




430 








ACCTTTGCGG 


GGCCACACCT 


TTCCAAATCT 


TCGATGTCTC 


ACCATCTAAG 


TCTCTCACTG 


1473 


CCCACCTTGG 


CGAGGAGAAC 


AGACCAACCT 


CTCCTGAGCC 


TTCCCTCACC 


TCCCAACCGG 


1533 


AAGCATGTAA 


6GGTTCCA6A 


AACCTGAGCG 


TGCAGCAGCT 


GATGAGCGCC 


CTTTCCTTCT 


1593 


GGCACGTGAC 


GGACAAGATC 


CTACCAGCTA 


CCACAGCAAA 


CGCCTAAGAG 


CAGGAAAAAT 


1653 


GTCTGCCAGG 


AAAGTGTCCA 


GTGTCCACAT 


G6CCCCTGGC 


GCTCTGAGTC 


TTTGAGGAGT 


1713 


AATCGCAAGC 


CTCGTTCAGC 


TGCAGCAGAA 


GGAAGGGCTT 


AGCCAGGGTG 


GGCGCTGGCG 


1773 


TCTGTGTTiGA 


AGGGAAACCA 


AGCAGAAGCC 


ACTGTAATGA 


TATGTCACAA 


TAAAACCCAT 


1833 


GAAT6AAAAA 


AAAAAATU^ 


AAAAAAAAAA 


AAAAGAATTC 






1873 
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35 



40 



45 



Gin Glu Arg Arg Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala - Pro 
^5 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly 
85 90 95 ^ 

Pro Asp Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr 

100 105 110 

Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp 
115 120 125 

Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu 
130 135 140 

Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser 
145 150 155 160 

Lys He Pro Glu Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr 
165 170 175 

Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr 
180 185 190 

Val Tyr Gin Val Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe 
195 200 205 

Leu Leu Asp Ser Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val 
210 215 220 

Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arq His 
225 230 235 240 

Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He 

245 250 255 

Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys 
260 265 270 

Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg 

-275 280 285 

Ser He Arg Ser Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys 
290 295 300 

Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn 
305 310 315 320 

Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val 
325 330 335 

Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly 
340 345 . 350 

Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser 
355 360 365 

Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe 
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370 375 



380 



lie Asn Pro Asp Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu 

390 395 400 

Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu 
405 410 415 

Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1723 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4 90.. 1696 

(D) OTHER INFORMATION: /function^ "OSTEOGENIC PROTEIN" 
/products "hOP2-PP" 
/note== "hOP2 (cDNA) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

G6CGCCGGCA GAGCAGGAGT GGCTGGAGGA GCTGTGGTrG GAGCAGGAGG TGGCACGGCA 60 

GGGCTGGAGG GCTCCCTATG AGTGGCGGAG ACGGCCCAGG AGGCGCTCGA GCAACAGCTC 120 

CCACACCGCA CCAAGCGGTG GCTGCAGGAG CTCGCCCATC GCCCCTGCGC TGCTCGGACC 180 

GCGGCCACAG CCGGACTGGC GGGTACGGCG GCGACAGAGG CATTGGCCGA GAGTCCCAGT 240 

CCGCAGAGTA GCCCCGGCCT CGAGGCGGTG GCGTCCCGGT CCTCTCCGTC CAGGAGCCAG 300 

GACAGGTGTC GCGCGGCGGG GCTCCAGGGA CCGCGCCTGA GGCCGGCTGC CCGCCCGTCC 360 

CGCCCCGCCC CGCCGCCCGC CGCCCGCCGA GCCCAGCCTC CTTGCCGTCG GGGCGTCCCC 420 

AGGCCCTGGG TCGGCCGCGG AGCCGATGCG CGCCCGCTGA GCGCCCCAGC TGAGCGCCCC 480 

CGGCCTGCC ATG ACC GCG CTC CCC GGC CCG CTC TGG CTC CTG GGC CTG 528 
Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu 
1 5 10 

GCG CTA TGC GCG CTG GGC GGG GGC GGC CCC GGC CTG CGA CCC CCG CCC 576 
Ala Leu Cys Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro 
15 20 25 

GGC TGT CCC CAG CGA CGT CTG GGC GCG CGC GAG CGC CGG GAC GTG CAG 624 
Gly Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin 
30 35 40 45 
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CGC GAG ATC CTG GCG GT6 CTC GGG CTG CCT GGG CGG CCC CGG CCC err 
Arg Glu He Leu Ala Val Leu Gly Leu Pro ^ Sg Pro ^ Pro S 
50 55 60 

GCG CCA CCC GCC GCC TCC CGG CTG CCC GCG TCC GCG CCG CTC TTC ATG 
Ala Pro Pro Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met 
o5 70 75 

CTG GAC CTG TAC CAC GCC ATG GCC GGC GAC GAC GAC GAG GAC GGC crr 
Leu ASP Leu Tyr His Ala Met Ala Gly Asp Asp Sp ^ 2p Sy Sa 

85 90 

P^^ »^ ^ CTC GTC ATC AGC TTC GTT 

Pro Ala Glu Arg Arg Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val 
3= 100 105 

AAC ATG GTG GAG CGA GAC CGT GCC CTC GGC CAC CAG GAG CCC CAT TOT 
Asn Met val Glu Arg Asp Arg Ala Leu Gly His Sn So Ss S 

115 120 125 

^ r^?. ^ ^ ^ ATC CCG GCT GGG GAG GCG GTC 

Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala Gly Glu Ala Val 
130 135 

^ ^ m ° ™^ ^ ^ CCC AGC ATC CAC CTG CTC 

Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser He His Leu Leu 

150 155 

AAC AGG ACC CTC CAC GTC AGC ATC TTC CAG GTC GTC CAG GAG CAG TCC 
Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser 
. 1^0 165 

AAC AGG GA6 TCT GAC TTC TTC TTT TTC GAT CTT CAG ACG CTC CGA GCT 
Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala 

175 XBO 



185 



tt^ n,^ ^ ^'^^ ^^"^ ^"^^ GCA GCC ACT GAC TCC 

Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys 

195 200 205 

TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GTG GAG 
Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu 
210 215 220 

^ ^'^^ CTG GCC GGC CTG CTG GGT 

Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly 
225 230 235 

CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC GTG GTC ACT TTC TTC AGG 
Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg 
240 245 250 

GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG GCA GTG AGG CCA CTC AGG 
Ala Ser Pro Ser Pro lie Arg Thr Pro Arg Ala Val Arg Pro Leu Arg 
255 260 265 

AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG GCC AAC CGA CTC 
Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu 

275 280 285 

CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG GTC TGC 
Pro Gly He Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cvs 
290 295 . 300 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1246 



1296 



1344 



1392 
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Ir^ ^ 7^ ^"^^ '^^^ ^« CTC GGC TGG CTC GAC 1440 

Arg Arg His Glu Leu lyr Val Ser Phe Gin Asp Leu Gly Trp Leu S 
305 310 

TGG GTC ATC GCT CCC CAA GGC TAG TCG GCC TAT TAG TGT GAG GGG GAr i^hq 
Trp val lie Ala Pro Gin Gly Tyr Ser Ala Tyr gr G^u G^ Su 
320 325 

TGC TCC TTC CCA CTG GAC TCC TGC ATG AAT GCC ACC AAC CAC GCC ATC 1536 
Cys ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala ne 
335 340 

CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC aar rcr icq^ 
Leu Gin ser Leu Val His Leu Met Lys Pro AsS ^ SIJ Pro JJs Sa 

355 360 365 

TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC TCT GTG CTC TAC TAT t5Ar isio 
Cys cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Si Leu gr ^ 
370 375 380 

AGC AGC AAC AAC GTC ATC CTG CGC AAA CAC CGC AAC ATG GTC GTC AAG 1680 
Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn mS Val Val ^ '° 
385 390 395 

GCC TGC GGC TGC CAC T GAGTCAGCCC GCCCAGCCCT ACTGCAG 1703 
Ala Cys Gly Cys His '"^-^ 
400 

(2) INFORMATION FOR SBQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
1 5 10 15 

Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro Gly Cys Pro 
20 25 30 

Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin Arg Glu He 
35 40 45 

Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Pro Pro 
= 0 55 60 

Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu 

70 75 80 

Tyr His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala Pro Ala Glu 
85 90 35 

Arg Arg Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val 
100 105 110 

Glu Arg Asp Arg Ala Leu Gly His Gin Glu Pro His Trp Lys Glu Phe 
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115 120 



125 



Arg Phe Asp Leu Thr Gin He Pro Ala Gly Glu Ala Val Thr Ala Ala 

135 140 

Glu Phe Arg He Tyr Lys Val Pro Ser He His Leu Leu Asn Arg Thr 

150 155 160 

Leu His val Ser Met Phe Gin Val Val Gin Glu Gin Ser Asn Arg Glu 
1*5 170 175 

ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp Glu 
180 185 190 

Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys Trp Leu Leu 

200 205 

Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Glu Asp 

■^10 215 220 

Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly Gin Arg Ala 
'^^ 230 235 240 

Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg Ala Ser Pro 
245 250 255 

Ser Pro He Arg Thr Pro Arg Ala Val Arg Pro Leu Arg Arg Arg Gin 
260 265 270 

Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu Pro Gly He 
275 280 285 

Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys Arg Arg His 

295 300 

Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp Trp Val He 

310 315 320 

Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser Phe 
325 330 

Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala He Leu Gin Ser 
340 345 350 

Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala 

355 360 365 

Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn 
370 375 

Asn Val He Leu Arg Lys His Arg Asn Met Val Val Lys Ala Cys Gly 

390 395 400 

Cys His 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 93.. 1289 

(D) OTHER INFORMATION: /function^ "OSTEOGENIC PROTEIN" 
/product= "mOP2-PP'» 
/note= "inOP2 cDNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
GCCAGGCACA GGTGCGCCGT CTGGTCCTCC CCGTCTCGCG TCAGCCGAGC CCGACCAGCT 60 

ACCAGTGGAt GCGC6CCGGC TGAAAGTCCG AG ATG GCT ATG CGT CCC GGG CCA 113 

Met Ala Met Arg Pro Gly Pro 
1 5 

CTC TGG CTA TTG GGC CTT GCT CTG TGC GCG CTG GGA GGC GGC CAC GGT 161 
Leu Trp Leu Leu Gly Leu Ala Leu Cys Ala Leu Gly Gly Gly His Gly 
10 15 20 

CCG CGT CCC CCG CAC ACC TGT CCC CAG CGT CGC CTG GGA GCG CGC GAG 209 
Pro Arg Pro Pro His Thr Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu 
25 30 35 

CGC CGC GAC ATG CAG CGT GAA ATC CTG GCG GTG CTC GGG CTA CCG GGA 257 
Arg Arg Asp Met Gin Arg Glu lie Leu Ala Val Leu Gly Leu Pro Glv 
^0 45 50 . 55 

CGG CCC CGA CCC CGT GCA CAA CCC GCC GCT GCC CGG CAG CCA GCG TCC 305 
Arg Pro Arg Pro Arg Ala Gin Pro Ala Ala Ala Arg Gin Pro Ala Ser 
60 65 70 



353 



GCG CCC CTC TTC ATG TTG GAC CTA TAC CAC GCC ATG ACC GAT GAC GAC 
Ala Pro Leu Phe Met Leu Asp Leu Tyr His Ma Met Thr Asp Asp Asp 
75 80 85 

GAC GGC GGG CCA CCA CAG GCT CAC TTA GGC CGT GCC GAC CTG GTC ATG 401 
Asp Gly Gly Pro Pro Gin Ala His Leu Gly Arg Ala Asp Leu Val Met 
90 95 100 

AGC TTC GTC AAC ATG GTG GAA CGC GAC CGT ACC CTG GGC TAC CAG GAG 449 
Ser Phe Val Asn Met Val Glu Arg Asp Arg Thr Leu Gly Tyr Gin Glu 
105 no 

CCA CAC TGG AAG GAA TTC CAC TTT GAC CTA ACC CAG ATC CCT GCT GGG 497 
Pro His Trp Lys Glu Phe His Phe Asp Leu Thr Gin He Pro Ala Gly 
120 125 130 135 

GAG GCT GTC ACA GCT GCT GAG TTC CGG ATC TAC AAA GAA CCC AGC ACC 545 
Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Glu Pro Ser Thr 
140 145 150 

CAC CCG CTC AAC ACA ACC CTC CAC ATC AGC ATG TTC GAA GTG GTC CAA 593 
His Pro Leu Asn Thr Thr Leu His He Ser Met Phe Glu Val Val Gin 
155 160 165 

GAG CAC TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG 641 
Glu His Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr 
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170 



175 



180 



CTC CGA TCT GGG GAG GAG GGC TGG CTG GTG CTG GAC ATC ACA GCA GCC 
Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu Asp He Thr Ala Ala 
185 190 

AGT GAC CGA TGG CTG CTG AAC CAT CAC AAG GAC CTG GGA CTC CGC CTC 
Ser Asp Arg Trp Leu Leu Asn His His Lys Asp Leu Gly Leu Arg Leu 

205 210 215 

TAT GTG GAA ACC GCG GAT GGG CAC AGC ATG GAT CCT GGC CTG GCT GGT 
Tyr Val Glu Thr Ala Asp Gly His Ser Met Asp Pro Gly Leu Ala Gly 
220 225 230 

CTG CTT GGA CGA CAA GCA CCA CGC TCC AGA CAG CCT TTC ATG GTA ACC 
Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin Pro Phe Met Val Thr 
235 240 245 

TTC TTC AGG GCC AGC CAG AGT CCT GTG CGG GCC CCT CGG GCA GCG AGA 
Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala Pro Arg Ala Ala Arg 
250 255 260 

CCA CTG AAG AGG AGG CAG CCA AAG AAA ACG AAC GAG CTT CCG CAC CCC 
Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn Glu Leu Pro His Pro 
265 270 275 

AAC AAA CTC CCA GGG ATC TTT GAT GAT GGC CAC GGT TCC CGC GGC AGA 
Asn Lys Leu Pro Gly He Phe Asp Asp Gly His Gly Ser Arg Gly Arq 
280 285 290 295 

GAG GTT TGC CGC AGG CAT GAG CTC TAG GTC AGC TTC CGT GAC CTT GGC 
Glu val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly 
300 305 310 

TGG CTG GAC TGG GTC ATC GCC CCC CAG GGC TAC TCT GCC TAT TAC TGT 
Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys 
315 320 325 

GAG GGG GAG TGT GCT TTC CCA CTG GAC TCC TGT ATG AAC GCC ACC AAC 
Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn 
330 335 340 

CAT GCC ATC TTG CAG TCT CTG GTG CAC CTG ATG AAG CCA GAT GTT GTC 
His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro Asp Val Val 
345 350 355 

CCC AAG GCA TGC TGT GCA CCC ACC AAA CTG AGT GCC ACC TCT GTG CTG 
Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu 
360 365 370 375 

TAC TAT GAC AGC AGC AAC AAT GTC ATC CTG CGT AAA CAC CGT AAC ATG 
Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn Met 
380 385 390 

GTG GTC AAG GCC TGT GGC TGC CAC TGAGGCCCCG CCCAGCATCC TGCTTCTACT 
Val Val Lys Ala Cys Gly Cys His 
395 

ACCTTACCAT CTGGCCGGGC CCCTCTCCAG AGGCAGAAAC CCTTCTATGT TATCATAGCT 
CAGACAGGGG CAATGGGAGG CCCTTCACTT CCCCTGGCCA CTTCCTGCTA AAATTCTGGT 
CTTTCCCAGT TCCTCTGTCC TTCATGGGGT TTCGGGGCTA TCACCCCGCC CTCTCCATCC 



689 

737 

785 

833 

881 

929' 

977 

1025 

1073 

1121 

1169 

1217 

1265 

1319 

1379 
1439 
1499 
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TCCTACCCCA AGCATAGACT GAATGCACAC AGCATCCCAG AGCTATGCTA ACTCAGAGGT 1559 

CTGGGGTCAG CACTGAAGGC CCACATGAGG AAGACTGATC CTTGGCCATC CTCAGCCCAC 1619 

AATGGCAAAT TCTGGATGGT CTAAGAAGGC CCTGGAATTC TAAACTAGAT GATCTGGGCT 1679 

CTCTGCACCA TTCATTGTGG CAGTTGGGAC ATTTTTAGGT ATAACAGACA CATACACTTA 1739 

GATCAATGCA TCGCTGTACT CCTTGAAATC AGAGCTAGCT TGTTAGAAAA AGAATCAGAG 1799 

CCAGGTATAG CGGTGCATGT CATTAATCCC AGCGCTAAAG AGACAGA6AC AGGAGAATCT 1859 

CTGTGAGTTC AAGGCCACAT AGAAAGAGCC TGTCTCGGGA GCAGGAAAAA AAAAAAAAAC 1919 

GGAATTC ^^2^ 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ala Met Arg Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
^5 10 15 

Ala Leu Gly Gly Gly His Gly Pro Arg Pro Pro His Thr Cys Pro Gin 
20 25 30 

Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Met Gin Arg Glu He Leu 
35 40 45 

Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Gin Pro Ala 

50 55 60 

Ala Ala Arg Gin Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr 
^5 70 75 80 

His Ala Met Thr Asp Asp Asp Asp Gly Gly Pro Pro Gin Ala His Leu 
85 90 95 

Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val Glu Arg Asp 
100 105 iio 

Arg Thr Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp 
115 120 125 

Leu Thr Gin He Pro Ala Gly Glu Ala* Val Thr Ala Ala Glu Phe Arq 
130 135 140 

He Tyr Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His He 
145 150 155 160 

Ser Met Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu 
165 170 175 

Phe Phe Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu 
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.180 185 



190 



Val Leu Asp He Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His 
195 200 205. 

Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser 

210 215 220 

Met Asp Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser 
225 230 235 ^ 240 

Arg Gin Pro Phe Met Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val 
245 250 255 

Arg Ala Pro Arg Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lvs 
260 265 270 

Thr Asn Glu Leu Pro His Pro Asn Lys Leu Pro Gly lie Phe Asp Asp 
275 280 285 

Gly His Gly Ser Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr 
290 295 300 

Val Ser Phe Arg Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin 

310 315 320 

Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp 
325 330 335 

Ser Cys Met Asn Ala Thr Asn His Ma lie Leu Gin Ser Leu Val His 
340 345 350 

Leu Met Lys Pro Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys 
355 360 365 

Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie 
370 375 380 

Leu Arg Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
385 390 395 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1368 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1368 

(D) OTHER INFORMATION: /labels "6 OA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG TCG GGA CTG C6A AAC ACC TCG GAG GCC GTT GCA GTG CTC GCC TCC 
Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
I S 10 15 
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CTG GGA CTC GGA ATG GTT CTG CTC ATG TTC GTG GCG ACC ACG CCG CCG 96 
Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

GCC GTT GAG GCC ACC CAG TCG GGG ATT TAC ATA GAC AAC GGC AAG GAC 
Ala Val Glu Ala Thr Gin Ser Gly He Tyr He Asp Asn Gly Lys Asp 
35 40 45 

CAG ACG ATC ATG CAC AGA GTG CTG AGC GAG GAC GAC AAG CTG GAC GTC 
Gin Thr He Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

TCG TAC GAG ATC CTC GAG TTC CTG GGC ATC GCC GAA CGG CCG ACG CAC 
Ser Tyr Glu He Leu Glu Phe Leu Gly He Ala Glu Arg Pro Thr His 
65 70 



75 80 



144 



192 



240 



480 



CTG AGC AGC CAC CAG TTG TCG CTG AGG AAG TCG GCT CCC AAG TTC CTG 288 
Leu Ser Ser Hxs Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 

CTG GAC GTC TAC CAC CGC ATC ACG GCG GAG GAG GGT CTC AGC GAT CAG 336 
Leu Asp Val Tyr His Arg He Thr Ala Glu Glu Gly Leu Ser Asp Gin 
100 105 110 

GAT GAG GAC GAC GAC TAC GAA CGC GGC CAT CGG TCC AGG AGG AGC GCC 384 
Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 

GAC CTC GAG GAG GAT GAG GGC GAG CAG CAG AAG AAC TTC ATC ACC GAC 432 
Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe He Thr Asd 
130 135 140 

CTG GAC AAG CGG GCC ATC GAC GAG AGC GAC ATC ATC ATG ACC TTC CTG 
Leu Asp Lys Arg Ala He Asp Glu Ser Asp He He Met Thr Phe Leu 

150 155 160 

AAC AAG CGC CAC CAC AAT GTG GAC GAA CTG CGT CAC GAG CAC GGC CGT 528 
Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 

165 170 175 

CGC CTG TGG TTC GAC GTC TCC AAC GTG CCC AAC GAC AAC TAC CTG GTG 576 
Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

ATG GCC GAG CTG CGC ATC TAT CAG AAC GCC AAC GAG GGC AAG TGG CTG 624 
Met Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Tro Leu 
195 200 205 

ACC GCC AAC AGG GAG TTC ACC ATC ACG GTA TAC GCC ATT GGC ACC GGC 672 
Thr Ala Asn Arg Glu Phe Thr lie Thr Val Tyr Ala He Gly Thr Gly 
210 215 220 

ACG CTG GGC CAG CAC ACC ATG GAG CCG CTG TCC TCG GTG AAC ACC ACC 720 
Thr Leu Gly Gin His Thr Met Glu Pro Leu Ser Ser Val Asn Thr Thr 
225 230 235 240 

GGG GAC TAC GTG GGC TGG TTG GAG CTC AAC GTG ACC GAG GGC CTG CAC 768 
Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 

GAG TGG CTG GTC AAG TCG AAG GAC AAT CAT GGC ATC TAC ATT GGA GCA 816 
Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly He Tyr He Gly Ala 
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260 265 270 

CAC GCT GTC AAC CGA CCC GAC CGC GAG GTG AAG CTG GAC GAC ATT GGA 864 
His Ala Val Asn J^g Pro Asp Arg Glu Val Lys Leu Asp Asp lie Gly 
275 280 285 

CTG ATC CAC CGC AAG GTG GAC GAC GAG TTC CAG CCC TTC ATG ATC GGC 912 
Leu He His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met He Gly 
290 295 300 

TTC TTC CGC GGA CCG GAG CTG ATC AAG GCG ACG GCC CAC AGC AGC CAC 960 
Phe Phe Arg Gly Pro Glu Leu He Lys Ala Thr Ala His Ser Ser His 
305 310 315 320 

CAC AGG AGC AAG CGA AGC GCC AGC CAT CCA CGC AAG CGC AAG AAG TCG 1008 
His Arg Ser Lys Arg Ser Ala Ser . His Pro Arg Lys Arg Lys Lys Ser 
325 330 335 

GTG TCG CCC AAC AAC GTG CCG CTG CTG GAA CCG ATG GAG AGC ACG CGC 1056 
Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
340 345 350 

AGC TGC CAG ATG CAG ACC CTG TAC ATA GAC TTC AAG GAT CTG GGC TGG 1104 
Ser Cys Gin Met Gin Thr Leu Tyr He Asp Phe Lys Asp Leu Glv Tro 
355 360 365 

CAT GAC TGG ATC ATC GCA CCA GAG GGC TAT GGC GCC TTC TAC TGC AGC 1152 
His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

GGC GAG TGC AAT TTC CCG CTC AAT GCG CAC ATG AAC GCC ACG AAC CAT 1200 
Gly Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His 
385 390 395 400 

GCG ATC GTC CAG ACC CTG GTC CAC CTG CTG GAG CCC AAG AAG GTG CCC 1248 
Ala He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 
405 410. 415 

AAG CCC TGC TGC GCT CCG ACC AGG CTG GGA GCA CTA CCC GTT CTG TAC " 1296 
Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 

CAC CTG AAC GAC GAG AAT GTG AAC CTG AAA AAG TAT AGA AAC ATG ATT 1344 
His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met He 
435 440 445 

GTG AAA TCC TGC GGG TGC CAT TGA 1358 
Val Lys Ser Cys Gly Cys His 
450 455 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 aruino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
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10 



15 



Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

Ala Val Glu Ala Thr Gin Ser Gly He Tyr He Asp Asn Gly Lys Asp 
35 40 . 45 

Gin Thr He Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

Ser Tyr Glu He Leu Glu Phe Leu Gly lie Ala Glu Arg Pro Thr His 
^5 70 75 80 

Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 

Leu Asp Val Tyr His Arg He Thr Ala Glu Glu Gly Leu Ser Asp. Gin 
100 105 

Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 

Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe He Thr Asp 
130 135 140 

Leu Asp Lys Arg Ala He Asp Glu Ser Asp He He Met Thr Phe Leu 
145 150 155 160 

Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 

Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

Met Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

Thr Ala Asn Arg Glu Phe Thr He Thr Val Tyr Ala He Gly Thr Glv 
210 215 220 

Thr Leu Gly Gin His Thr Met Glu Pro Leu Ser Ser Val Asn Thr Thr 
225 230 235 240 

Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 

Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly He Tyr He Gly Ala 
260 265 270 

His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp He Glv 
275 280 285 

Leu. He' His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met He Gly 
290 295 300 

Phe Phe Arg Gly Pro Glu Leu He Lys Ala Thr Ala His Ser Ser His 
305 310 315 320 

His Arg Ser Lys Arg Ser Ala Ser His Pro Arg Lys Arg Lys Lys Ser 
325 330 335 

Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
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340 345 350 

Ser Cys Gin Met Gin Thr Leu Tyr He Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

Gly Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His 
385 390 395 4OO 

Ala He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 
405 410 415 

Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 

His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met He 
435 440 445 

Val Lys Ser Cys Gly Cys His 
450 455 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 69.. 1268 

(D) OTHER INFORMATION: /note= "inOP3-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GGATCCGCGG CGCTGTCCCA TCCTTGTCGT CGAGGCGTCG CTGGATGCGA GTCCGCTAAA 60 

CGTCCGAG ATG GCT GCG CGT CCG GGA CTC CTA TGG CTA CTG GGC CTG GCT 110 
Met Ala Ala Arg Pro Gly Leu Leu Trp Leu Leu Gly Leu Ala 
15 10 

CTG TGC GTG TTG GGC GGC GGT CAC CTC TCG CAT CCC CCG CAC GTC TTT 158 
Leu Cys Val Leu Gly Gly Gly His Leu Ser His Pro Pro His Val Phe 
15 20 25 30 

CCC CAG CGT CGA CTA GGA GTA CGC GAG CCC CGC GAC ATG CAG CGC GAG 206 
Pro Gin Arg Arg Leu Gly Val Arg Glu Pro Arg Asp Met Gin Arg Glu 
35 40 .45 

ATT CGG GAG GTG CTG GGG CTA GCC GGG CGG CCC CGA TCC CGA GCA CCG 254 
He Arg Glu Val Leu Gly Leu Ala Gly Arg Pro Arg Ser Arg Ala Pro 
50 55 60 

GTC GGG GCT GCC CAG CAG CCA GCG TCT GCG CCC CTC TTT ATG TTG GAC 302 
Val Gly Ala Ala Gin Gin Pro Ala Ser Ala Pro Leu Phe Met Leu Asp 
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CTG TAG CGT GCC ATG ACG GAT GAC AGT GGC GOT GGG ACC CCG GAG CCT 350 
Leu Tyr Arg Ala Met Thr Asp Asp Ser Gly Gly Gly Thr Pro Gin Pro 
80 85 90 

CAC TTG GAC CGT GCT GAC CTG ATT ATG AGC TTT GTC AAC ATA GTG GAA 398 
His Leu Asp Arg Ala Asp Leu He Met Ser Phe Val Asn He Val Glu 
95 100 105 110 

CGC GAC CGT ACC CTG GGC TAC CAG GAG CCA CAC TGG AAG GAA TTC CAC 446 
Arg Asp Arg Thr Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His 

120 125 

TTT GAC CTA ACC CAG ATC CCT GCT GGG GAG GCT GTC ACA GCT GCT GAG 494 
Phe Asp Leu Thr Gin He Pro Ala Gly Glu Ala Val Thr Ala Ala Glu 
130 135 

TTC CGG ATC TAC AAA GAA CCC AGT ACC CAC CCG CTC AAC ACA ACC CTC 542 
Phe Arg He Tyr Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu 
145 150 155 

CAC ATC AGC ATG TTC GAA GTG GTC CAA GAG CAC TCC AAC AGG GAG TCT 590 
His He Ser Met Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser 
160 165 170 

GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA TCT GGG GAC GAG GGC 638 
Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly 
175 180 185 190 

TGG CTG GTG CTG GAC ATC ACA GCA GCC AGT GAC CGA TGG CTG CTG AAC 686 
Trp Leu Val Leu Asp He Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn 
195 200 205 

CAT CAC AAG GAC CTA GGA CTC CGC CTC TAT GTG GAA ACC GAG GAT GGG 734 
His His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly 
210 215 220 

CAC AGC ATA GAT CCT GGC CTA GCT GGT CTG CTT GGA CGA CAA GCA CCA 782 
His Ser He Asp Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro 
225 230 235 

CGC TCC AGA CAG CCT TTC ATG GTT GGT TTC TTC AGG GCC AAC CAG AGT 830 
Arg Ser Arg Gin Pro Phe Met Val Gly Phe Phe Arg Ala Asn Gin Ser 
240 245 250 

CCT GTG CGG GCC CCT CGA ACA GCA AGA CCA CTG AAG AAG AAG CAG CTA ' 878 
Pro Val Arg Ala Pro Arg Thr Ala Arg Pro Leu Lys Lys Lys Gin Leu 
255 260 265 270 

AAT CAA ATC AAC CAG CTG CCG CAC TCC AAC AAA CAC CTA GGA ATC CTT 926 
Asn Gin He Asn Gin Leu Pro His Ser Asn Lys His Leu Gly He Leu 
275 280 285 

GAT GAT GGC CAC GGT TCT CAC GGC AGA GAA GTT TGC CGC AGG CAT GAG 974 
Asp Asp Gly His Gly Ser His Gly Arg Glu Val Cys Arg Arg His Glu 
290 295 300 

CTC TAT GTC AGC TTC CGT GAC CTT GGC TGG CTG GAC TCT GTC ATT GCC 1022 
Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Leu Asp Ser Val He Ala 
305 310 315 

CCC CAG GGC TAC TCC GCC TAT TAC TGT GCT GGG GAG TGC ATC TAC CCA 1070 
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Pro Gin Gly Tyr Ser Ala Tyx Tyx Cys Ala Gly Glu Cys He Tyr Pro 
320 325 330 

CTG AAC TCC TGT ATG AAC TCC ACC AAC CAC GCC ACT ATG CAG GCC CTG 
Leu Asn Ser Cys Met Asn Ser Thr Asn His Ala Thr Met Gin Ala Leu 
335 340 345 350 

GTA CAT CTG ATG AAG CCA GAT ATC ATC CCC AAG GTG TGC TGT GTG CCT 
Val His Leu Met Lys Pro Asp He He Pro Lys Val Cys Cys Val Pro 
355 360 365 

ACT GAG CTG AGT GCC ATT TCT CTG CTC TAC TAT GAT AGA AAC AAT AAT 
Thr Glu Leu Ser Ala He Ser Leu Leu Tyr Tyr Asp Arg Asn Asn Asn 
370 375 380 

GTC ATC CTG CGC AGG GAG CGC AAC ATG GTA GTC CAG GCC TGT GGC TGC 1262 
Val He Leu Arg Arg Glu Arg Asn Met Val Val Gin Ala Cys Gly Cys 
385 390 395 

CAC TGAGTCCCTG CCCAACAGCC TGCTGCCATC CCATCTATCT AGTCAGGCCT 13 15 
His 

400 

CTCTTCCAAG GCAGGAAACC AACAAAGAGG GAAGGCAGTG CTTTCAACTC CATGTCCACA 1375 

TTCACAGTCT TGGCCCTCTC TGTTCTTTTT GCCAAGGCTG AGAAGATGGT CCTAGTTATA 1435 

ACCCTGGTGA CCTCA6TAGC CCGATCTCTC ATCTCCCCAA ACTCCCCAAT GCAGCCAGGG 1495 

GCATCTATGT CCTTTGGGAT TGGGCACAGA AGTCCAATTT ACCAACTTAT TCATGAGTCA 1555 

CTACTGGCCC AGCCTGGACT TGAACCTGGA ACACAGGGTA GAGCTCAGGC TCTTCAGTAT 1615 

CCATCAGAAG ATTTAGGTGT GTGCAGACAT GACCACACTC CCCCTAGCAC TCCATAGCC 1674 

(2) INFORMATION FOR SEQ ID NO; 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Ala Ala Arg Pro Gly Leu Leu Trp Leu Leu Gly Leu Ala Leu Cys 
IS 10 15 

Val Leu Gly Gly Gly His Leu Ser His Pro Pro His Val Phe Pro Gin 
20 25 30 

Arg Arg Leu Gly Val Arg Glu Pro Arg Asp Met Gin Arg Glu He Arg 
35 40 45 

Glu Val Leu Gly Leu Ala Gly Arg Pro Arg Ser Arg Ala Pro Val Gly 
50 55 . 60 

Ala Ala Gin Gin Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr 
65 70 75 80 

Arg Ala Met Thr Asp Asp Ser Gly Gly Gly Thr Pro Gin Pro His Leu 
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85 



90 



95 



Asp Arg Ala Asp Leu He Met Ser Phe Val Asn He Val Glu Arg Asd 
100 . 105 110 

Arg Thr Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp 

120 125 

Leu Thr Gin He Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg 



135 



140 



He Tyr Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His He 

150 155 160 

Ser Met Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu 
165 .170 175 

Phe Phe Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Tro Leu 
180 185 190 . 

Val Leu Asp He Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His 
195 200 205 

Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser 
210 215 220 • 



He Asp Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser 

230 235 240 

Arg Gin Pro Phe Met Val Gly Phe Phe Arg Ala Asn Gin Ser Pro Val 
245 250 255 

Arg Ala Pro Arg Thr Ala T^g Pro Leu Lys Lys Lys Gin Leu Asn Gin 
260 265 270 

He Asn Gin Leu Pro His Ser Asn Lys His Leu Gly He Leu Asp Aso 
275 280 285 

Gly His Gly Ser His Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr 
290 295 300 

Val Ser Phe Arg Asp Leu Gly Trp Leu Asp Ser Val He Ala Pro Gin 

310 315 320 

Gly Tyr Ser Ala Tyr Tyr Cys Ala Gly Glu Cys He Tyr Pro Leu Asn 
325 330 335 

Ser Cys Met Asn Ser Thr Asn His Ala Thr Met Gin Ala Leu Val His 
340 345 350 

Leu Met Lys Pro Asp He fie Pro Lys Val Cys Cys Val Pro Thr Glu 
355 360 365 

Leu Ser Ala He Ser Leu Leu Tyr Tyr Asp Arg Asn Asn Asn Val He 
370 375 380 

Leu Arg Arg Glu Arg Asn Met Val Val Gin Ala Cys Gly Cys His 
385 390 395 



<2} INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..104 

(D) OTHER INFORMATION: /not e= "BMP3" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp Ser 
15 10 



. 15 



Glu Trp He He Ser Pro Lys Ser Phe Asp Ala Tyr Tyr Cys Ser Gly 
20 25 30 

Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 
35 40 45 

Thr He Gin Ser He Val Ala Arg Ala Val Gly Val Val Pro Glv He 
50 55 60 

Pro Glu Pro Cys Cys Val Pro Glu Lys* Met Ser Ser Leu Ser He Leu 

70 75 80 

Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met 
85 90 95 

Thr Val Glu Ser Cys Ala Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note- "BMP5" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
1 5 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asd Glv 
20 25 30 
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Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 '80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HCMO SAPIENS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "BMP6" 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Cys Arg Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
1 5 10 15 

Asp Trp He He Ala Pro Lys Gly. Tyr Ala Ala Asn Tyr Cys Asp Gly 

20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lvs 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Trp Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CH7U?ACTERISTICS : 

(A) LENGTH: 1247 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(P) TISSUE TYPE: BRAIN 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 84.. 1199 

(D) OTHER INFORMATION: /product= "GDF-1" 
/note= "GDF-1 CDNA" 



60 
110 

158 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGGGACACCG GCCCCGCCCT CAGCCCACTC GTCCCGGGCC GCCGCGGACC CTGCGCACTC 

TCTGGTCATC GCCTGGGAGG AAG ATG CCA CCG CCG CAG CAA GGT CCC TGC 

Met Pro Pro Pro Gin Gin Gly Pro Cys 
^ 5 

GGC CAC CAC CTC CTC CTC CTC CTG GCC CTG CTG CTG CCC TCG CTG CCC 
Gly His His Leu Leu Leu Leu Leu Ala Leu Leu Leu Pro Ser Leu Pro 
10 15 20 25 

CTG ACC CGC GCC CCC GTG CCC CCA GGC CCA GCC GCC GCC CTG CTC CAG 206 
Leu Thr Arg Ala Pro Val Pro Pro Gly Pro Ala Ala Ala Leu Leu Gin 
30 35 . 40 

GCT CTA GGA CTG CGC GAT GAG CCC CAG GGT GCC CCC AGG CTC CGG CCG 254 
Ala. Leu Gly Leu Arg Asp Glu Pro Gin Gly Ala Pro Arg Leu Arg Pro 
45 50 55 

GTT CCC CCG GTC ATG TG6 CGC CTG TTT CGA CGC CGG GAC CCC CAG GAG 302 
Val Pro Pro Val. Met Trp Arg Leu Phe Arg Arg Arg Asp Pro Gin Glu 
60 65 70 

ACC AGG TCT GGC TCG CGG CGG ACG TCC CCA GGG GTC ACC CTC CAA CCG 350 
Thr Arg Ser Gly Ser Arg Arg Thr Ser Pro Gly Val Thr Leu Gin Pro 
"5 80 85 

TGC CAC GTG GAG GAG CTG GGG GTC GCC GGA AAC ATC GTG CGC CAC ATC 398 
Cys His val Glu Glu Leu Gly Val Ala Gly Asn He Val Arg His He 

95 100 105 

CCG GAC CGC GGT GCG CCC ACC CGG GCC TCG GAG CCT GTC TCG GCC GCG 446 
Pro Asp Arg Gly Ala Pro Thr Arg Ala Ser Glu Pro Val Ser Ala Ala 
110 115 120 

GGG CAT TGC CCT GAG TGG ACA GTC GTC TTC GAC CTG TCG GCT GTG GAA 4 94 

Gly His Cys Pro Glu Trp Thr Val Val Phe Asp Leu Ser Ala Val Glu 
125 130 135 

CCC GCT GAG CGC CCG AGC CGG GCC CGC CTG GAG CTG CGT TTC GCG GCG 542 
Pro Ala Glu Arg Pro Ser Arg Ala Arg Leu Glu Leu Arg Phe Ala Ala 
140 145 150 

GCG GCG GCG GCA GCC CCG GAG GGC GGC TGG GAG CTG AGC GTG GCG CAA 590 
Ala Ala Ala Ala Ala Pro Glu Gly Gly Trp Glu Leu Ser Val Ala Gin 
155 160 165 
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GCG GGC CAG GGC GCG GGC GCG GAC CCC GGG CCG GTG CTG CTC CGC CAG 63fl 
Ala Gly Gin Gly Ala Gly Ala Asp Pro Gly Pro Val Leu Leu Arg Gin 
170 175 180 185 . 

rro GTG CCC GCC CTG GGG CCG CCA GTG CGC GCG GAG CTG CTG GGC GCC 686 
Leu Val Pro Ala Leu Gly Pro Pro Val Arg Ala Glu Leu Leu Gly Ala 
190 195 200 

GCT TGG GCT CGC AAC GCC TCA TGG CCG CGC AGC CTC CGC CTG GCG CTG 
Ala Trp Ala Arg Asn Ala Ser Trp Pro Arg Ser Leu Arg Leu Ala Leu 
205 210 215 

GCG CTA CGC CCC CGG GCC CCT GCC GCC TGC GCG CGC CTG GCC GAG GCC 782 
Ala Leu Arg Pro Arg Ala Pro Ala Ala Cys Ala Arg Leu Ala Glu Ala 
220 225 230 

TCG CTG CTG CTG GTG ACC CTC GAC CCG CGC CTG TGC CAC CCC CTG GCC 830 
Ser Leu Leu Leu Val Thr Leu Asp Pro Arg Leu Cys His Pro Leu Ala 
235 240 245 

CGG CCG CGG CGC GAC GCC GAA CCC GTG TTG GGC GGC GGC CCC GGG GGC 
Arg Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 
250 255 260 



878 



GCT TGT CGC GCG CGG CGG • CTG TAC GTG AGC TTC CGC GAG GTG GGC TGG 926 
Ala Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp 
270 275 ' 280 

CAC CGC TGG GTC ATC GCG CCG CGC GGC TTC CTG GCC AAC TAC TGC CAG 974 
His Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin 
285 290 295 

GGT CAG TGC GCG CTG CCC GTC GCG CTG TCG GGG TCC GGG GGG CCG CCG 1022 
Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro 
300 305 310 

GCG CTC AAC CAC GCT GTG CTG CGC GCG CTC ATG CAC GCG GCC GCC CCG 1070 
Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro 
315 320 325 

GGA GCC GCC GAC CTG CCC TGC TGC GTG CCC GCG CGC CTG TCG CCC ATC 1118 
Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro He 
330 335 340 345 

TCC GTG CTC TTC TTT GAC AAC AGC GAC AAC GTG GTG CTG CGG CAG TAT 1166 
Ser Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr 
350 355 360 

GAG GAC ATG GTG GTG GAC GAG TGC GGC TGC CGC TAACCCGGGG CGGGCAGGGA 1219 
Glu Asp Met Val Val Asp Glu Cys Gly Cys Arg 
365 370 

CCCGGGCCCA ACAATAAATG CCGCGTGG 1247 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Pro Pro Pro Gin -Gin Gly Pro Cys Gly His His Leu Leu Leu Leu 
1 5 10 15 

Leu Ala Leu Leu Leu Pro Ser Leu Pro Leu Thr Arg Ala Pro Val Pro 

20 25 . 30 

Pro Gly Pro Ala Ala Ala Leu Leu Gin Ala Leu Gly Leu Arg Asp Glu 
35 40 45 

Pro Gin Gly Ala Pro Arg Leu Arg Pro Val Pro Pro Val Met Trp Arg 

50 55 gQ 

Leu Phe Arg Arg Arg Asp Pro Gin Glu Thr Arg Ser Gly Ser Arg Arq 
" 70 75 80 

Thr Ser Pro Gly Val Thr Leu Gin Pro Cys His Val Glu Glu Leu Gly 
85 90 95 

Val Ala Gly Asn He Val Arg His He Pro Asp Arg Gly Ala Pro Thr 
100 105 110 

Arg Ala Ser Glu Pro Val Ser Ala Ala Gly His Cys Pro Glu Trp Thr 
115 120. 125 

Val Val Phe Asp Leu Ser Ala Val Glu Pro Ala Glu Arg Pro Ser Ara 
130 135 140 

Ala Arg Leu Glu Leu Arg Phe Ala Ala Ala Ala Ala Ala Ala Pro Glu 

150 155 160 

Gly Gly Trp Glu Leu Ser Val Ala Gin Ala Gly Gin Gly Ala Gly Ala 
165 170 175 

Asp Pro Gly Pro Val Leu Leu Arg Gin Leu Val Pro Ala Leu Gly Pro 
180 185 190 

Pro val Arg Ala Glu Leu Leu Gly Ala Ala Trp Ala Arg Asn Ala Ser 
195 200 205 

Trp Pro Arg Ser Leu Arg Leu Ala Leu Ala Leu Arg Pro Arg Ala Pro 
210 215 220 

Ala Ala Cys Ala T^g Leu Ala Glu Ala Ser Leu Leu Leu Val Thr Leu 
225 230 235 240 

Asp Pro Arg Leu Cys His Pro Leu Ala Arg Pro Arg Arg Asp Ala Glu 
245 250 255 

Pro Val Leu Gly Gly Gly Pro Gly Gly Ala Cys Arg Ala Arg Arg Leu 
260 265 270 

Tyr Val Ser Phe Arg Glu Val Gly Trp His Arg Trp Val He Ala Pro 
275 280 285 

T^g Gly Phe Leu Ala Asn Tyr Cys Gin Gly Gin Cys Ala Leu Pro Val 
290 295 300 

Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala Leu Asn His Ala Val Leu 
305 310 315 320 
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Arg Ala Leu Met His Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cvs 
325 330 335 

Cys val Pro Ala Arg Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn 
340 345 350 

Ser Asp Asn Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu 
355 360 365 

Cys Gly Cys Arg 
370 
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CLAIMS 

What is claimed is: 

11. A method of therapy for a mammal at risk of, or aflflicted with, loss of or damage to 

2 myocardium, the method comprising 

3 implanting a preparation of myogenic precursor cells into said mammal at a site at risk of, 

4 or afflicted with, loss of or damage to myocardium, and 
treating said myogenic precursor cells with an amount of a morphogen sufficient to 

6 promote proliferation or differentiation of said myogenic precursor cells into functional 

7 myocardium, 

1 2. A method of therapy for a mammal at risk o^ or afflicted with, loss of or damage to 

2 myocardium, the method comprising 

3 implanting a preparation of myogenic precursor cells into said mammal at a site at risk of, 

4 or afflicted with, loss of or damage to myocardium, and 

5 treating said mammal with an amount of an inducer of a morphogen encoded by a gene of 

6 said mammal, said amount being sufficient to promote proliferation or differentiation of said 

7 myogenic precursor cells mto functional myocardium. 

13. A method of therapy for a mammal at risk of, or afflicted with, loss of or damage to 

2 myocardium, the method comprising 

3 implanting a preparation of myogenic precursor cells into said mammal at a site at risk of, 

4 or afflicted with, loss of or damage to myocardium, and 
treating said myogenic precursor cells with an amount of an agonist of a morphogen 

receptor expressed by said myogenic precursor cells, said amount being sufficient to promote 
proliferation or diflferentiation of said myogenic precursor ceUs into flmctional myocardium. 

1 4. A method of therapy for a mammal at risk of, or afflicted with, loss of or damage to 

2 myocardium, the method comprising 
implanting a preparation of myogenic precursor cells into said mammal at a site at risk of, 



3 



4 or afflicted with, loss of or damage to myocardium, and 
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treating said myogenic precursor cells with an amount of a small molecule morphogenic 
activator, said amount being sufficient to promote proliferation or diflferentiation of said myogenic 
7 precursor cells into functional myocardium. 

1 5. A method as in any one of claims 1-4 wherein said myogenic precursor cells are selected 

2 from the group consisting of mammalian skeletal muscle satellite cells, embryonic myogenic 

3 precursor cells, and a histocompatible mammalian myogenic precursor cell line. 

1 6. A method as in any one of claims 1 -4 wherein said myogenic precursor cells are 

2 autologous skeletal muscle satellite cells. 

17. A method as in any one of claims 1 -4 wherein 

2 said mammal is afflicted with a condition selected from tiie group consisting of myocardial 

3 infarction and congestive heart failure. 

1 8, A method as in any one of claims 1-4 wherein said treatment step is conducted prior to 

2 said implantation step. 

1 9. A method as in any one of claims 1-4 wherein said treatment step is conducted 

2 simultaneous with said implantation step. 

1 10. A method as in any one of claims 1-4 wherein said treatment step is conducted subsequent 

2 to said implantation step. 

1 11. A method as in claim 10 wherein said treatment step is at least once a week for a period of 

2 at least four weeks. 

1 12. A metiiod as in claim 10 wherein said treatment step is at least once a montii for a period 

2 of at least one year. 

13. A method as in claim 1 wherein said morphogen treatment step is conducted with 
morphogen at a concentration of about 0.01-1000 ng/ml. 

14. A metiiod as in claim 1 wherein said morphogen treatment step is conducted witii 
morphogen at a concentration of about 0.1-100 ng/ml. 

15. A metiiod of promoting proliferation of myogenic precursor cells or differentiation of 
myogenic precursor cells into functional myocardium comprising the steps of: 
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(a) contacting said cells with a morphogen in an amount eflFective to induce said 



4 proliferation or differentiation; and 

(b) maintaining said cells in a morphogenicaUy permissive environment. 
16. A method as in claim 1 wherein said morphogen is selected from the group consisting of a 

2 pro form of a morphogen, a sohible form of a morphogen, a mature morphogen, and a C-terminal 

3 fragment of a morphogen comprising at least the seven cystdne domain of said morphogen. 

1 17. A method as in claim 1 wherein said morphogen is selected from the group consisting of 

2 osteogenic proteins and bone morphogenic proteins. 

1 18. A method as in claim 1 wherdn said morphogm 

2 induces a cascade of tissue-specific morphogenesis culminating in the formation of 

3 functional mammalian myocardium; and 

4 comprises a pair of folded polypeptides, the amino acid sequence of each of which 

5 comprises a sequence having at least 70% amino acid sequence homology with the C-terminal 

6 seven-cysteine domain of human OP-1, mouse OP-1, human OP-2 or mouse OP-2, residues 38- 

7 139 of SEQ ID NOs. 5, 6, 7 or 8, respectively. 

1 19. A method as in claim 1 wherran said morphogen is selected from the group consisting of 

2 OP-1, CBMP-2A(BMP-2), and CBMP-2B (BMP-4). 

1 20. A therapeutic composition for promoting the repair or regeneration of mammalian 

2 myocardium comprising 

3 isolated mammalian myogenic precursor cells, and 

4 an amount of a morphogen sufficient to promote proliferation or differentiation of said 

5 myogenic precursor cells into functional myocardium in a morphogenicaUy permisave 

6 environment. 



21 . A therapeutic composition for promoting the repair or regeneration of mammalian 



2 myocardium comprising 



3 



isolated mammalian myogenic precursor cells, and 
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4 an amount of an inducer of a morphogen sufficient to promote proliferation or 

5 differentiation of said myogenic precursor cells into functional myocardium in a morphogenically 

6 permissive environment. 

1 22. A therapeutic composition for promoting the repair or regeneration of mammalian 

2 myocardium comprising 

3 isolated mammalian myogenic precursor cells, and 

4 an amount of an agonist of a morphogen receptor sufficient to promote proliferation or 

5 differentiation of said myogenic precursor cells into functional myocardium in a morphogenically 

6 permissive environment. 

1 23. A therapeutic composition for promoting the repair or regeneration of mammalian 

2 myocardium comprising 

3 isolated mammalian myogenic precursor cells, and 

4 an amount of a small molecule morphogenic activator sufficient to promote proliferation 

5 or diflferentiation of said myogenic precursor cells into functional myocardium in a 

6 morphogenically permissive environment. 

1 24. A method of culturing mammalian myogenic precursor cells comprising 

2 isolatmg said myogenic precursor cells, and 

3 culturing said myogenic precursor cells in a medium comprising an amount of a 

4 morphogen sufficient to promote proliferation or differentiation of said myogenic precursor cells 

5 into functional myocardium in a morphogenically permissive environment. 

1 25 . A method of culturing mammalian myogenic precursor cells comprising 

2 isolating said myogenic precursor cells, and 

3 culturing said myogenic precursor cells in a medium comprising an amount of an inducer 

4 of a morphogen sufficient to promote proliferation or differentiation of said myogenic precursor 

5 cells into functional myocardium in a morphogenically permissive environment. 

1 26. A method of culturing mammalian myogenic precursor cells comprising 

2 isolating said myogenic precursor cells, and 
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3 

•4 
5 

1 
2 
3 
4 
5 



2 



5 



2 



culturing said myogenic precursor pells in a medium comprising an amount of an agonist 
of a morphogen receptor sufficient to promote proliferation or differentiation of said myogenic 
precursor cells into fimctional myocardium in a morphogenically permissive environment. 

27. A method of culturing mammalian nqrogenic precursor cells comprising 
isolating said myogenic precursor cells, and 

culturing said myogenic precursor cells in a medium comprising an amount of a small 
molecule morphogenic activator sufficient to promote proliferation or differentiation of said 
myogenic precursor ceUs into functional myocardium in a morpliogenicaUy permissive 



6 environment. 



1 28. 



3 of 



A method of inducing myogenic precursor cells, naturally competent to differentiate into 
skeletal or smootii muscle, to diflferentiate into cardiomyocytes. said metiiod comprising tiie steps 



4 (a) contacting said nqrogenic precursor cells with a morphogai; and 



6 cardiomyogenesis. 



(b) maintaining the product of (a) in an environment morphogenically permissive for 



1 29. 



A metiiod of producing replacement cardiomyocytes in a mammal in need thereof, said 
metiiod comprising tiie step of implanting into said mammal myogenic precursor cells induced by 
3 the method of claim 28. 

I 30. A pharmaceutical composition comprising 

a morphogenic agent selected from tfie group consisting of a morphogen, a morphogen 
inducer, an agonist of a morphogen receptor, and a small molecule moiphogenic activator; and 
a mitogen sdected from tiie group consisting of bFGF, IGF, PDGF, LIF, ACTH, MSH, 



2 
3 
4 

5 and G-CSF. 
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